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Abstract 

Given an integer k, we consider the parallel ^-stripping process applied to a hyper¬ 
graph H: removing all vertices with degree less than k in each iteration until reaching 
the fc-core of H. Take H as T-Lr{n,m): a random r-uniform hypergraph on n vertices 
and m hyperedges with the uniform distribution. Fixing k,r >2 with {k,r) ^ (2,2), 
it has previously been proved that there is a constant such that for all m = cn 
with constant c ^ Cr,k, with high probability, the parallel /c-stripping process takes 
O(logn) iterations. In this paper we investigate the critical case when c = Cr^k + o(l). 
We show that the number of iterations that the process takes can go up to some power 
of n, as long as c approaches sufficiently fast. A second result we show involves 
the depth of a non-/c-core vertex v. the minimum number of steps required to delete v 
from 7ir{n,m) where in each step one vertex with degree less than k is removed. We 
will prove lower and upper bounds on the maximum depth over all non-Zc-core vertices. 


1 Introduction 


Given a nonnegative integer k and a (hyper)graph H, the Ucore of H, denoted by Ck{H), is 
the maximum subgraph of H with minimum degree at least k. The fc-core was first studied 
by Bollobas[5] and has since become a major focus in random graph theory. Its many appli¬ 
cations include erasure codes[25, 26], colouring [31], hashing[19], and graph orientability[18, 
7, 13, 12]. We define 'Hr{n,m) to be a random r-uniform hypergraph on vertex set [n] := 
{1, 2,..., n} and exactly m hyperedges, with the uniform distribution. We focus on sparse 
random hypergraphs with bounded average degree; thus the typical range of focus is m = 
0(n). The threshold for the appearance of a non-empty /c-core in 'Hr{n,cn) was first de¬ 
termined by Pittel, Spencer and Wormald [34] for r = 2 and k > 3, whereas for general 
(r, k) 7 ^ (2, 2), the threshold is determined [30, 24] to be: 


Cr,k 
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The k-core of a (hyper)graph H can be obtained by repeatedly removing all vertices with 
degree less than k. In the parallel fc-stripping process, all vertices with degree less than k 
are removed at once in each iteration until the fc-core is reached. The number of rounds 
this process takes is called the k-stripping number, denoted by Sk{H). This number is an 
important parameter associated with a (hyper)graph and its /c-core. Jiang, Mitzenmacher 
and Thaler [23] discussed several applications of the parallel stripping process in computer 
science and the importance of analysing the /c-stripping number. An upper bound O(logn) 
is given in [1] for SkiTirin, cn)) when c > Cr^k + e and this bound is proved to be tight in [23]. 
Independently, [21] proved an upper bound of poly(logn). On the other hand, it was proved 
in [23, 15] that Sk{'Hrin,cn)) = O(loglogn) if c < Cr^k — e (in fact, results in [1, 23, 15] are 
presented for 'Hr-{n,p), p = but they easily translate to 'Hr{n,m) as well). However, 

as we will prove in this paper, the stripping process can take a long time as c —)• Cr^k- the 
number of rounds needed can go up to some power of n, depending on the rate at which c 
approaches Cr^k- 

We give some intuitive explanation of this phenomenon. Note that Cr^k is the emergence 
threshold of the /c-core. When c —)■ Cr^k, the parallel /c-stripping process undergoes a bottle¬ 
neck. At a certain point, the number of vertices removed in each iteration (i.e. of degree less 
than k) becomes sublinear in n. If there is no /c-core, then the process continues for a long 
enough time to eventually pass through the bottleneck: the number of vertices removed in 
each iteration becomes and stays linear in n again and eventually all vertices of the hyper¬ 
graph will be removed, producing an empty /c-core. If there is a /c-core, then the process 
terminates after only o{n) total vertices are removed during the bottleneck; what remains 
is the giant /c-core. In both cases, the number of remaining vertices with degree less than k 
mimics a random walk whose expected change with each vertex deletion is very close to zero 
during the bottleneck. Hence a large number of vertices must be removed for it to either 
decrease to 0 or increase to linear in n. Since few vertices are removed in each iteration, it 
takes many iterations to remove this large number of vertices. 

The analysis for the stripping number when c —)■ Cr^k becomes subtle. For c inside the 
critical window \c — Cr^k\ = 0{l/y/n), it is not certain that T-Lrin, cn) has a non-empty /c-core; 
i.e. the probability that T-Lrin, cn) contains a non-empty /c-core is bounded away from 0 and 1 
as n oo [22]. This brings in certain complications in our analysis of the stripping number, 
especially for the upper bound. We will prove a lower bound for the stripping number when 
c < Cr^k + (note this range contains the critical window), whereas the upper bound 

for c in this range will be studied in a subsequent paper. For c above Cr^k + we will 

prove both upper and lower bounds that are tight in the asymptotic order. It is interesting 
to note that several graph parameters/properties related to the /c-core have a radical change 
when c approaches to this threshold, including the robustness of the /c-core [36]. 

Rather than removing all vertices with degree less than k at once in each iteration, we 
may remove one vertex at a time. This produces a stripping sequence: a sequence of vertices 
removed from the initial hypergraph H . Vertices in a stripping sequence can appear in an 
arbitrary order, as long as each vertex has degree less than k at the moment of its removal. 
Note that a stripping sequence does not necessarily need to terminate with the /c-core: for a 
stripping sequence T, a subsequence of T composed of the first arbitrary number of vertices in 
T is also a stripping sequence. Given a non-/c-core vertex v, we are interested in the length of 
a shortest stripping sequence ending with v. roughly speaking, this is the minimum number 
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of vertices required to be deleted before deleting v. This number is called the depth of v. A 
formal dehnition is given in Section 2. The maximum depth of 'Hr{n^ m) (over all non-/c-core 
vertices) is bounded in [1] by O(logn) when c differs from Cr^k by some absolute constant. In 
this paper, our goal is to bound (both from above and below) this parameter when c —>■ Cr^k- 
The aforementioned bounds on the depth and stripping number from [1, 21] were moti¬ 
vated by applications to solution clustering in random XORSAT. We will provide analogous 
applications for our present results in a subsequent paper (a preliminary version containing 
partial results on their applications to clustering in random XORSAT is available in [17]). 


2 Main results 


We hrst give a formal dehnition of the parallel /c-stripping process and the fc-stripping 
number. 

Definition 1. The parallel k-stripping proeess, applied to a hypergraph H, consists of itera¬ 
tively removing all vertices of degree less than k at once along with any hyperedges containing 
any of those vertices, until no vertices of degree less than k remain; i.e. until we are left with 
the k-coie of H. We use S', to denote the vertices that are removed during iteration i. We use 
Hi to denote the hypergraph remaining after i — 1 iterations, i.e. after removing Si,Si-i. 

We will analyze the number of rounds that this process takes: 

Definition 2. The k-stripping number of H, denoted Sk{H), is the number of iterations 
in the parallel fc-stripping process applied to H. We often drop the “A:” and speak of the 
stripping number and s{H). 

We use the following standard notation. Given a sequence of probability spaces fin, we 
say a sequence of events An occurs asymptotically almost surely (a.a.s.) if the probability 
that An occurs in tends to 1 as u —)■ cx). With two sequences of real numbers On and bn, 
we use a„ = 0(6„) to denote that there is an absolute constant C such that |a„| < C\hn\- 
We write a„ = o{hn) if lim„_^oo On/^n = 0. We write a„ = fl{bn) if bn = 0(an) and On, bn > 0 
eventually; a„ = 0(bn) if an = 0{bn) and bn = 0(a„) and an,bn > 0 eventually. Thus, if 
On = fl(bn) or On = Q{bn) then On, bn must both be positive (for large n). All asymptotics 
in this paper refer to u ^ cxo. 

We hrst present our main result on the stripping number of 'Hr{n, cn). 

Theorem 3. Consider constants r,k >2, (r, k) ^ (2, 2) and 0 < 5 < 1/2. 

(a) If c = Cr^k +then a.a.s. s('Hj.(n, cn)) = 0(n^/^ log u). 

(b) If \c — Cr,k\ < n~^ then a.a.s. s{'Hrin, cn)) = Il{n^^'^). 

So if we roughly dehne the following three ranges of c, the behaviour of the stripping 
number is as follows: 

• c = Cr^k + 'a~^, 0 < (5 < 1/2. Then the stripping number is 0(n^/^ logn). 
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• c = Cr^k — n 0<5<l/2. Then the stripping number is at least 

• c is between Cr^k — and Cr^k + Then the stripping number is 

p^(^l/4+o(l))_ 

In the hrst range, the stripping number is specihed within a constant factor. In the other 
two ranges, we do not say anything about upper bounds on the stripping number; those 
upper bounds will be studied in a subsequent paper. 

Remark. With the same proof, n~^ in Theorem 3 (and in all relevant lemmas in Sections 3 
and 4) can be replaced by (and correspondingly is replaced by ^/T[^ for any = 
o(l) such that for some constant e > 0 (e.g. = 1/ logn or logn). 

We use the less general statement for a cleaner presentation. 

We are also interested in the number of vertices that must be deleted in order to remove 
a particular vertex v. he. what is the smallest number of vertices that must be deleted in 
order to demonstrate that v is not in the /c-core? 

Definition 4. A k-stripping sequence is a sequence of vertices that can be deleted from a 
hypergraph, one-at-a-time, along with their incident hyperedges such that at the time of 
deletion each vertex has degree less than k. For any vertex v not in the fc-core, the depth of 
V is the length of a shortest /c-stripping sequence ending with v. 

While every non-fc-core vertex has depth O(logn) for any constant c > Ck^r, as proved 
in [1], one of our main contributions in this paper is to prove that when c approaches the 
k-coie emergence threshold, the maximum depth can rise to 

Theorem 5. Let r,/c > 2, {r,k) ^ (2,2) he fixed. There are constants Z = Z{k,r) and 
K = K{k,r) such that: for any fixed 0<5<l/2, z/c = Cr^k + then a.a.s. the maximum 
depth of all non-k-core vertices in 'Hr{n,cn) is between Zn^!"^ and . 

Remark. 

(a) Again here and in Theorem 41 in Section 5 below, with the same proof, n~^ above (and 

in all relevant lemmas in Sections 5 and 6) can be replaced by fn for any = o(l/log^n) 
and fn > for some constant e > 0. We did not try to optimize the power of logn. 

(b) For the upper bound in this theorem, we will actually prove a stronger statement, because 
the stronger statement will be used in another paper on solution clustering in random XOR- 
SAT. See Theorem 41 in Section 5. 

(c) Both Theorems 3 and 5 translate to 'Hr{n,p) with p = r\c/n^~^. Translations of a.a.s. 

properties from to Tifin^p) are usually standard by conditioning the number of 

hyperedges X in Tifin^p) on its typical values, as long the properties under consideration 
hold for all m such that |m—EW| = 0(\/VarX). Our situation is a little subtle as our bounds 
on the stripping number depend on how close c is to and some of our results cover the 
case that \{c — Cr^k)n\ is smaller than the standard deviation of X, e.g. if \c — Cr^k\ = o{n~^l‘^). 
The reason that the translation holds in this paper is that for c in the range in Theorem 3(b), 
the lower bound is uniform for all c and the size of the range is of order n^~^, which is much 
greater than the deviation of X, for any S < 1/2. 

The fc-stripping number clearly bounds the maximum depth from below, and so the lower 
bound of from Theorem 3 implies the lower bound in Theorem 5. The difficult part 

of the proof of Theorem 5 is to show the upper bound. 
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Our requirement that c > Cr^k + in Theorem 5 comes from the similar restriction 

on the actual appearance of the fc-core, as shown by Kim[24] (see also [9]; a more precise 
statement of Theorem 6(b) is given in Lemma 7 in Section 3): 

Theorem 6. [ 24 ] For r, fc > 2, (r, k) ^ (2, 2) and for any constant e > 0. 

(a) For c < Cr^k — a.a.s. the k-core ofFirin^cn) is empty. 

(b) For c > Cr,k + a.a.s. Firin, cn) has a k-core with 0(n) vertices. 

The most challenging difference between the setting of this paper and that of [1] is: when 
c = Cr^k + 0(1)) number of vertices with degree less than k remaining after each round 
of the parallel stripping process drops geometrically (see the comment following Lemma 16). 
That easily implies that a.a.s. the stripping number is O(logn). Furthermore, the property 
which implies that this number drops geometrically is also key in the analysis of [1] proving 
that the depth is O(logn). For c = c^^k + the number of such vertices drops much more 
slowly, leading to an increase in the stripping number and hence in the maximum depth, 
which is easily seen to be bounded from below by the stripping number. This requires us to 
use a very different, and much more intricate, analysis. 

One novelty in our analysis of the depth is as follows: The most straightforward approach 
to analyze the stripping process is to repeatedly expose the vertices and edges removed in 
each iteration. Instead, we only expose the vertex sets S'i,S' 2 ,.... The advantage is that, 
when considering the depth of a vertex u in a particular level S'*, we can treat the edges 
removed in previous iterations as random. Thus even though we have exposed the fact that 
V is in S'*, there is still some randomness that we can make use of in the specihc sequence of 
vertices whose deletions led to v being deleted in iteration i. Of course, exposing the vertex 
sets that are deleted in each step exposes something about the edges that are deleted, so we 
have to be careful about the conditional distribution of those edges. The details of how we 
do this are in Section 5.1. 

We excluded the case (r, k) = (2, 2) from our main results as this case has already been 
extensively studied. The 2-core of the random graph G{n, cn) is well-studied[27, 28, 35, 10, 
11, 14]. For 0 < (5 < I and p = 1 + n~^ a.a.s. the largest component consists of a large 
2-core C 2 , with a Poisson Galton-Watson tree of branching parameter 1 — rooted at each 
vertex. In addition, there are 0(n) smaller components distributed essentially like Poisson 
Galton-Watson trees of branching parameter 1 — n~^, except that some of them contain a 
single cycle. We can dehne the stripping number to be the stripping number of the largest 
component, or to be the maximum of the stripping number of all components. Either way, 
it is of the order of the height of the tallest of those trees which is n^poly(logn). Similarly, 
the maximum depth is of the order of the size of the largest tree, which is ?7,^‘^poly(logn). 

We will prove Theorem 3(b), and the upper bound for Theorem 3(a) in Section 4, along 
with a weaker version of the lower bound for Theorem 3(a). The lower bound for Theo¬ 
rem 3(a) is proved in Section 6. We prove Theorem 5 in Section 5. In Section 3 we discuss 
the size of the k-core as a preparation for the analysis in Sections 4 - 6. A key lemma used 
in the proof of Theorems 3(a) and 5 will be presented in Section 6. 
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3 Size of the k-core 


We give a more precise version of Theorem 6(b), where the size of the k-core is specihed, 
when c > Cr^k + We start by dehning: 


/t(h) 







( 2 ) 

(3) 


Note that /t(/i) is the probability that a Poisson variable with mean p, denoted by Po(/i), 
is at least t. Thus by (1), 


^r,k 


mffM. 

M>o r 


Now for any r, fc > 2, (r, k) ^ (2, 2), we dehne pr.fc to be the value of /i that minimizes h(/i); 

i.e. 

Iir,k is the unique solution to h{fir,k) = rcr^k (4) 


Dehne 


Q, Cy,r,k fk(^k'r,k)y Pr,k ^k'r,kfk—l(^k'r,k')- 


(5) 


For ease of notation, we drop most of the r, k subscripts. For any c > Cr,fc, we dehne /i(c) 
to be the larger solution to 

h(/i) 


c = 


Then, fir,k = h(cr,fc)- Dehne 


«(c) = /fc(h(c)), /3(c) = -/i(c)/fc_i(p(c)). 

r 


( 6 ) 


Theorem 1.7 of [24] yields the size of the k-core as follows (Theorem 1.7 of [24] is for 
'Hr{n,p) but it easily translates to T-Lrin, m) by a standard coupling argument for 'Hr{n,p) and 
'Hr{'n,m), and by the fact that having a non-empty k-core is an increasing graph property): 

Lemma 7. Fix r, k > 2,{r,k) 7 ^ (2,2) and e > 0. If c > Cr^k + n then a.a.s. the 
k-core of'Hr{'n,cn) has 

• a{c)n -I- 0(n^/^) vertices and 

• /3(c)n-I-0(n^/^) edges. 

While the proof of [24, Theorem 1.7] (i.e. Lemma 7) is very technical, we give a brief 
heuristic explanation of why Cr^k is the emergence threshold for a non-empty k-core and why 
the fc-core of Hrin, cn) is expected to contain around a{c)n vertices, when c > Cr^k- For vertex 
V G Hr{n,cn), let Xt denote the probability that v survives after t iterations of the parallel 
stripping process. Intuitively, A := lim^^oo Ai is the probability that v is in the k-core of 
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T-Lrin, cn) and thus the fc-core is expected to contain An vertices. It is easy to prove that the 
neighbourhood of v locally converges in distribution to a hyper-tree generated by the Poisson 
branching process with parameter cr starting at v. Then, v survives after t iterations of the 
parallel stripping process if and only if v is incident with at least k hyperedges surviving 
from t — 1 iterations of the stripping process, i.e. for each of these surviving hyperedges x, 
all the other r — 1 vertices in x are incident with at least k — 1 other hyperedges that survive 
after t — 2 iterations of the stripping process, and so on. To compute Xt, let pt denote the 
probability that v is incident with at least k — 1 hyperedges surviving after t iterations of the 
stripping process. As mentioned before, the number of hyperedges incident with v is Po(cr). 
Each hyperedge survives after t — 1 iterations of the stripping process with probability p^Zl- 
It is then easy to prove that the number of hyperedges incident with v that survive after 
t — 1 iterations of the stripping process is Po(p[ricr). Hence, we derive the recursion for pt 
and Ap 


Po — 1; 

Pt = Pr(Po(p[AVr) > /c - 1); 

Xt = Pr(Po(p[riCr) > k). 

Put A = Pr(Po(p^“^cr) > P), where p satishes p = Pr(Po(p^“^cr) > k — 1). It is then easy 
to show that A = p = 0ifc< Cr^k and thus, the heuristics above implies that the fc-core is 
likely to be empty. For c > Cr^k, h is easy to show that pt converges to the larger root of 
p = Pr(Po(p''“^cr) >k — l) and then A gives a(c) in (6). The expected sum of the degrees 
is n X Ei>fc*Pr(Po(p" ^ cr) = i) which is 2(3. 

By a close analysis of Q!(c) and (3{c) we have: 

Lemma 8 . Fix r,k >2, (r, k) ^ (2, 2) and 0 < <5 < 1/2. There exist positive constants Ki, 
K 2 and such that if c = Cr^k + then 

p{c) - pr,k = + 0{n~^), 

a{c) — a = K 2 n~^^‘^ -|- 0{n~^), 

I3{c)-I3 = + 0{n-^). 

We present its proof in the Appendix. 

Lemma 7, along with Lemma 8 above, together with the fact that pfk-i{p)/fk{p) is an 
increasing function on /i > 0 (see Lemma 27 below), yields the following bounds on the size 
of the k-core: 

Corollary 9. Fix r,k > 2, (r, k) 7 ^ (2, 2) and 0 < 5 < 1/2. There exist positive constants 

Ki = Ki{r, k), K 2 = K 2 {r,k) and = K^^Vjk): if c = Cr^k + n~^ then a.a.s. the k-core of 

Hr (n, cn) has 

(a) an -\- Kin^~^/‘^ -|- -T n^^^) vertices and 

(b) (dn + K 2 n}~^l‘^ F 0{n^~^ + n^/^) edges and 

(c) average degree rfd/a + -|- 0{n~^ -|- n~^^‘^). 
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We close this section by introducing two terms that are standard when analyzing the 
fc-core stripping process: 

Definition 10. The light vertices of a hypergraph are the vertices of degree less than k. The 
heavy vertices are those of degree at least k. 

4 Bounding the stripping number 


We restate the parallel stripping process in a slowed-down version, which will be more 
convenient to analyze. Rather than removing all vertices of Si at once, we remove them one 
at a time. When removing a vertex, we slow down further by removing one edge at a time. 

To facilitate this, we maintain a queue ^ containing all deletable vertices; i.e. all vertices 
of degree less than k: 

SLOW-STRIP 
Input: A hypergraph G. 

Initialize: t := 0, Gq := G, is a list of all vertices of degree less than k in G, 
ordered uniformly at random. 

While ^ 0: 

Let V be the next vertex in 

Remove a hyperedge e selected uniformly at random from all those containing v. 

If V now has degree 0 then remove that vertex from G and from 

If any other vertex of e has its degree drop to below k then add that vertex to the end of 
Gt+i is the resulting hypergraph; t := t + 1. 


At any point, Kiay contain some vertices from Si and some from Aj+i. However, the 
vertices of Si are removed before the vertices of 5'j+i. Note also that when processing a 
vertex v ^ J2t, all edges from v are removed (and hence v is removed) before moving to 
the next vertex of So this procedure removes vertices in the same order as the parallel 
stripping process. In particular, if t is the step in SLOW-STRIP when the first vertex of Si 
is selected then Gt is the hypergraph remaining after the first i — 1 iterations of the parallel 
stripping process and at the beginning of step t, = Si. 

Definition 11. We use t{i) to denote the iteration of SLOW-STRIP at which iteration i of 
the parallel stripping process begins; i.e. when the first vertex of Si reaches the front of 

We also define: 


T is the iteration in which SLOW-STRIP halts. 

We will focus much of our analysis on the following parameters of Gt- 

• Lt is the total degree of the light vertices in G*; i.e. of the vertices in 

• Nt is the number of heavy vertices in Gt] i.e. of the vertices outside of .Sf 


• Dt is the total degree of the heavy vertices in Gt. 


We denote the triple of these values as: 




and 

J't = := {%, • • •) %}■ 

We first give an overview of the proof of Theorem 3. The key arguments are to rather 
precisely describe the evolution of {Lt)t>o, especially in a critical range of t. We will show 
that after a sufficiently large but bounded number B of iterations of the parallel stripping 
process, the number of vertices remaining in the hypergraph becomes very close to an (see 
Lemma 15). Then, we will prove that in SLOW-STRIP, Lt decreases with at least a certain 
rate for all t > t{B) (see Lemma 16). Using that we can bound from below the rate at which 
(or I S'* I, roughly speaking) decreases for each i > B. This allows us to obtain the upper 
bound for Sk{'Hr{n,cn)) as in Theorem 3. 

To obtain the lower bound of Sk{'Hr{n,cn)) in the supercritical case as in Theorem 3(a), 
we will tightly bound the rate (from both below and above) at which Lt decreases for t > t{B) 
(See Section 6). This enables us to establish a rather precise description of (|S'j|)i>B (see 
Lemma 47), and hence deduce the desired lower bound on the stripping number. 

However, we will first present a slightly weaker lower bound, i.e. without the logarithmic 
factor, with a much simpler proof in Section 4.3. We do so because this weaker bound is 
part of the proof of Theorem 3(b). The key idea is to focus on the steps of SLOW-STRIP 
during which the last vertices are removed before reaching the k-core, for some 

constant K > 0. We will show that with high probability Lt = in all these steps 

(see Lemma 19). We will then consider the iterations of the parallel stripping process during 
which the last vertices are removed. In each iteration, the total number of vertices 

being removed must be since their total degree is In order to remove 

vertices, at least iterations of parallel stripping are required. This yields 

the slightly weaker lower bound. This same proof, combined with a coupling argument, will 
then yield the lower bound claimed in Theorem 3(b) (see Sections 4.4 and 4.5). 

4.1 The allocation-partition model 

We use the following allocation-partition model (AP-model), denoted by APj.{n,m). It 
was used in [18] and is a slight modihcation of what is called the pairing-allocation model 
in [6]. We are given a set of rm vertex-copies. We represent each vertex u as a bin. We 
choose two random objects: (i) a uniformly random partition of the vertex-copies into parts 
of size exactly r; (ii) a uniform allocation of each vertex-copy into a bin. We call the output 
a configuration. 

Having chosen a configuration from APr{n,m), we can transform it into a hypergraph 
as follows: We contract the bins into vertices, and each part of the partition becomes a 
hyperedge. It is easy to show, with simple counting arguments, that all simple hypergraphs 
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with n vertices and m edges are generated with eqnal probability. {Simple means that no 
two hyperedges are identical, and no vertex appears twice in the same hyperedge). 

The AP-model differs from the conhguration model of Bollobas [4] in that the degree 
sequence is not fixed in advance, and the vertex-copies are not initially assigned to actual 
vertices. Note that the random partition and the random allocation are orthogonal and 
they can be chosen independently of each other. This will be very helpful below when we 
condition on events (specifically values of Lt, iV^, Dt) which specify partial information about 
both the partition and the allocation. 

'Hr{n,m) only selects simple hypergraphs. Of course, the hypergraph formed by the 
AP-model might not be simple, but when m = 0(n), the probability of obtaining a simple 
hypergraph is at least e for some absolute constant e > 0. This immediately yields: 

Corollary 12. If m = 0{n) and if property Q holds a.a.s. for APr{n,m), then Q holds 
a.a.s. for Hrin, m). 

4.2 SLOW-STRIP on the AP-model 

We will analyse the running of SLOW-STRIP on a random configuration generated by the 
AP-model, and then use Corollary 12 to translate the results to m). We continue 

to use hypergraph terminology to refer to the configuration, and so strictly speaking, a 
“vertex” is a bin, a “hyperedge” is an r-tuple of the partition along with an allocation of 
the vertex-copies in that r-tuple, Gt is the configuration remaining after t iterations, etc. 
We say a vertex-copy is light/heavy if the bin it is allocated to is light/heavy. Where we say 
“graph” in this context, we mean the (not neccessarily simple) hypergraph obtained from 
the configuration by contracting the bins. 

SLOW-STRIP runs on a configuration in APr{n,m) as follows: Initially, the queue 
contains all the light bins; i.e. the bins of size less than k. In each step, SLOW-STRIP 
removes a vertex-copy x of the bin u in the front of the queue, together with the r — 1 
vertex-copies in the same part as x. Each time we delete one of the Dt vertex-copies not in 
we query whether the bin containing that copy now has size /c — 1; if so then we move 
that bin to the queue. When all vertex-copies of u are removed then u is deleted from the 
queue. 

We will typically condition on the values of Pt = {{Ls, Ns, Ds)}s<t- The following obser¬ 
vation (first shown in [6]) enables our analysis. 

Observation 13. Upon conditioning on Pt, every allocation of the Dt heavy vertex-copies 
to the Nt heavy bins such that each bin has size at least k is egually likely. 

Proof We can use the method of deferred decisions to expose the configuration while 
running SLOW-STRIP on a configuration generated from the AP-model. Initially: for each 
bin u, we expose whether the size of u is less than k. If it is, then we expose all the vertex- 
copies assigned to u and place u into ,So- So for each of the Nq other bins, we have only 
exposed the fact that the size is at least k. The entire vertex-allocation of the configuration 
is completed by taking an allocation of the remaining Dq vertex-copies into those Nq bins, 
such that each bin receives at least k copies. Since each such {Dq —)■ A^"o)-allocation yields 
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a single allocation for the entire configuration, each such {Dq —)• iVo)-allocation is equally 
likely. 

In each step, when we remove a vertex-copy from J2t, we expose the r — 1 other vertex- 
copies in the same part. For each such copy w: if w has not already been allocated to a bin 
(i.e. if w ^ ^t) then we expose the bin that w is allocated to, delete tc, and then query 
whether the size of that bin has dropped below k. If it has, then we expose the k — 1 remaining 
vertex-copies in that bin and move it to Otherwise we expose nothing else about that 
bin; i.e. we have still only exposed that its size is at least k. The same reasoning as in the 
previous paragraph shows that every allocation of the remaining heavy vertex-copies to 
the Nt remaining heavy bins is equally likely. 

This proves that the lemma statement is true if we condition on all the information 
exposed during the first t iterations. Since the distribution of the allocation is a function 
only of Nf, D^, then the statement is true if we condition only on fFt. □ 

Since 'Hr{n, cn) is relatively hard to analyse directly; many of the existing proofs for the 
size of the fc-core of 'Hr{n,cn) use simpler models (e.g. [6] used the AP-model and [24] used 
the cloning model) for the analysis and then the results are translated to 'Hr{n,cn). Hence 
Lemma 7 and Corollary 9 actually hold for the AP-model ([6] did not obtain the sharp error 
as in Lemma 7 but it was claimed by the authors that with a bit more effort the full result 
as in Lemma 7 can be shown with their method using the AP-model). Thus we have the 
following lemma. 

Lemma 14. Both Lemma 7 and Corollary 9 hold for the AP-model. 

The reader may notice that our result does not actually require Lemma 14. By Lemma 7 
and Corollary 9, we may run SLOW-STRIP on a random hypergraph H G 'HriUiCn) until 
the remaining subgraph Gt has the desired order as required in Section 4.3. It is easy to 
see that by conditioning on Gt is uniformly distributed over all hypergraphs with graph 
parameters agreeing with %. Therefore, we could have applied the AP-model to Gt and 
started our analysis from there. However, it makes the proof a little easier to understand 
if we start with H G APr{n,cn) and apply Lemma 14, which guarantees that with a high 
probability, we will obtain some configuration Gt of the desired order. This is how Section 4.3 
will be presented. 

4.3 The supercritical case: c = Cr^k + 

We first consider the supercritical case in which we assume c = Cr^k + where 0 < 5 < 
1/2. These conditions on 6 are assumed in the rest of Section 4 as desired by the hypotheses 
in Theorem 3. Our goal is to prove that s{'Hr{n, cn)) is bounded above by 0{rAI‘^ logn) and 
below by Ll{n^l’^). Note that this lower bound is slightly weaker than that of Theorem 3(a), 
but the proof is much simpler. The tight bound of Theorem 3(a) will be proved in Section 6. 

Recall that we are running SLOW-STRIP on a configuration generated from the AP- 
model. 

The following lemma allows us to assume that the remaining subgraph is sufficiently close 
to the k-core. Roughly speaking, after a constant number of rounds of the parallel stripping 
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process, we can get within any linear distance of the fc-core. Recall that Hi is the subgraph 
remaining after i — 1 rounds of the parallel stripping process and Ck{H) is the k-core of H. 

Lemma 15. Let e, Cq > 0 he fixed. Assume c > . Then, for H G APr{n, cn), 

there exists a constant B = B{r,k,eo) > 0, such that a.a.s. \Hb \ Ck{H)\ < e^n. 

Proof. Let e' > 0 be a small constant to be determined later and let c' = c + e'. Choose H' 
according to the distribution of APr{n,c'n) and generate H by removing e'n edges chosen 
uniformly at random in H'. (he. we remove e'n uniform parts and the vertex-copies contained 
in those parts.) Then H O H' and H is distributed as APr{n,cn). Run the parallel k- 
stripping process on H', using H'i to denote the subgraph remaining after i — 1 iterations. 
Several other papers (see eg. Proposition 31 of [1]), show that for any a > 0 there exists 
a constant R > 0 such that [H'^ \ Ck{H')\ < an, if H' G 'Hr{n,c'n); the same conclusion 
translates to the AP-model with a similar argument. 

Note that every vertex removed during the hrst B — 1 iterations of the parallel stripping 
process applied to H' would also have been removed during the hrst B — 1 iterations of the 
parallel stripping process applied to H. Thus Hb P H'^. Also, Ck{H) C Ck{H') and, by 
Lemmas 7 and 14, \Ck{H') \ Ck{H)\ = {a{c') — a{c))n + o{n). Therefore: 

\HB\Ck{H)\ < \H'B\Ck{H')\ + \Ck{H')\Ck{H)\<an + {a{c!)-a{c))n + o{n) 

< 2(cr -|- a(c') — a{c))n < eon, 

where the last inequality holds as long as e', a are sufficiently small (both depending on 
eo). □ 

Lemma 15 says that for any eo > 0 there is a R = R(eo) such that after iteration R — 1 
of the parallel process, or equivalently, at the beginning of step t(R) of SLOW-STRIP, the 
size of the remaining hypergraph Gt(B) is at most eon greater than the size of the /c-core. 
This implies that various parameters are very close to those of the /c-core, and this is where 
our analysis starts: we focus on t > t{B). 

A key part of our analysis is to control the change in Lt. We will express the expected 
change of Lt at step t as a function of Lt, Nt and Dt. Recall: 

J't = {'Ts}s<t = {{Ls, Ns, Ds)}s<t 

and we will estimate E,{Lt+i \ R)). The following lemma bounds this expectation. 

Lemma 16. There are constants B,K such that: If c = Cr^k + n~^, then a.a.s. for every 
t{B) <t<T, 

E(Lt+i I Pt) <Lt- Kn-^^\ 

Remark. The key Lemma 32 of [1] implies that, when c = Cr^k + e, we have a.a.s. 
E(L 4 +i| Pt) < (1 — C)Lt for a constant C > 0. That difference is what causes the strip¬ 
ping number and depth to rise from O(logn) for c > Cr^k + e to n^P for c = Cr^k + R 
is also the cause of most of the difficulties in this paper. However, the weaker fact that 
E(Li+i — Li I Pt) remains bounded below —Kn~^^‘^, no matter how small Lt gets, is still very 
useful to our analysis. 
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Let a = a(r, k) be the constant given in (5) and Ki = iLi(r, k) be the constant specihed 
in Corollary 9. Then, by Corollary 9, for c = Cr^k + n~^^ a.a.s. the size of the fc-core is 
an + 

Lemma 16 will be used to bound the stripping number from above. To obtain a lower 
bound, we will focus on the so-called Phase 2 of SLOW-STRIP, dehned as follows: 

7 = 

Definition 17. to is the hrst step of SLOW-STRIP at the beginning of which the number 
of vertices in the remaining hypergraph is exactly an + 7 . We refer to steps t = 0,....,to~l 
as Phase 1 and the remaining steps as Phase 2. 

Note that we can obtain a lower bound on the number of vertices removed in Phase 2: 

Lemma 18. If c = Cr^k + then a.a.s. at least 7/8 = Kin^~^^'^ vertices are removed in 
Phase 2. 

Proof By Corollary 9(a), a.a.s. the /c-core has fewer than an P vertices 

and so SLOW-STRIP has to remove at least vertices to reach it. □ 

We will show that, throughout Phase 2, Lt is small. 

Lemma 19. If c = Cr^k + n~^, then a.a.s. for every to < t < r, = 0{n^~^). 

We defer the proofs of Lemmas 16 and 19 to Section 4.4. 

We now show how these lemmas nearly yield a proof of Theorem 3(a). It is easy to use 
Lemmas 18 and 19 to bound from below the total number of hyperedges that are removed 
during Phase 2. Lemma 19 bounds the total degree of each Si, which bounds from above 
the number of hyperedges that are deleted in each iteration. This implies that Phase 2 
requires many iterations; specihcally, it yields a lower bound of on the stripping 

number, which is within a factor of logn of the bound in Theorem 3(a). For the upper 
bound: Lemma 16 bounds the rate at which decreases and so implies that it drops to 
zero quickly; of course, when = 0 then the stripping process has ended. 

4.3.1 Theorem 3(a): proof of a weaker lower bound 


We begin with a lower bound of on the stripping number when c = Cr^k + 

for some 0 < 5 < i. This lower bound is slightly weaker than that of Theorem 3(a), but the 
proof is much simpler. Furthermore, this weaker lower bound will be used in Section 4.5 to 
prove Theorem 3(b). The proof for the tight lower bound of Theorem 3(a) will be presented 
in Section 6. 

We run SLOW-STRIP and recall that this can be viewed as also running the parallel 
stripping process slowly. At step to, an + vertices remain in the hypergraph. 

By Lemma 18, a.a.s. at least Kin^~^^'^ vertices are further removed before reaching the 
/c-core. All these vertices, except those whose degrees are below k at step to (and there 
are of them by Lemma 19), are incident with at least k edges. Thus, at least 

Kin^~^/‘^ — 0{n^~^) > {Ki/2)n^~^/‘^ hyperedges are removed from Gt during Phase 2. By 
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Lemma 19, at most \Lt(j) \ = 0{jn}~^) of them belong to Sj for each j. Therefore, we require at 
least iterations of the parallel stripping process to remove 

them all. 

This proves that a.a.s. the stripping number of APr{n, cn) is at least Corollary 12 

implies that the same is true of 'Hr{n,cn). □ 

4.3.2 Theorem 3(a): proof of the upper bound 


Now we turn to the upper bound on the stripping number when c = Cr^k + for some 

0 < (5 < i. 

We will focus on the change in the sum, over all v E Si of the degree of v at the 
beginning of iteration i of the parallel stripping process. 

From iterations t{i) to t{i + 1) — 1 of SLOW-STRIP, all hyperedges from all vertices in 
Si must be deleted. One hyperedge is removed in each iteration, and it touches at least one 
and at most r members of S'*. Thus 

+ 1) - f(i) < Lt(i). (7) 

Let B and K be constants specihed in Lemma 16. Let r* be the hrst step t such that 
t > and E(Li_|_i| Bt) > Lt — if there is no such step then we set r* = r. Then, 

for all t{B) < t < T*, 

E(Lt+i I Bt) <Lt- Kn-^/\ 

Taking conditional expectation on fFt(i) of both sides, for z > R, yields 

E(Li+i I < E(Li I for all t{i) <t <t*. 

If + 1) < r* then inductively applying the above for all t{i) < t < + 1) we get 


Dehne the random process (£j)i>o as follows. For all i such that t{i) < t*, let Ci = Lt(j) 
and for all i such that t{i) > t*, dehne Ci = Ti_i(l — {K/r)n~^^^). Let T denote the 
minimum integer such that Ct < 0; thus a.a.s. r = t(T). By Lemma 16 a.a.s. t* = r, and 
so a.a.s. Lt(i) = Ci for all 0 < i < T and Lt(T) = Ct = 0. Hence, we only need to obtain an 
upper bound for T. Now we have that for every i > B, 


E(A+i I Ci) < Ci 



Taking expectation on both sides we obtain that 


K 

ERj+i < E£j ( 1- n 


- 5/2 
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Hence, as Cb < n, for each i > B, 


< n f 1 - n 

r 


i-B 


Thus, for z > i? + ^n^/^logn, E(£j) = o(l) and so a.a.s. T < B + {2r/K)n^/‘^logn. This 
implies that a.a.s. the process of (£i)i>o reaches A < 0 within B + {2r/K)n ^logn iterations 
and so a.a.s. the parallel stripping process halts within B + (2r/\ogn iterations. 
Therefore a.a.s. the stripping number of APr{n, cn) is at most logn = logn); 

Corollary 12 implies that the same is true of Tirin, cn). □ 


4.4 Bounds on L^: proof of Lemmas 16 and 19 


In this section, we prove Lemmas 16 and 19. So throughout, we have c = Cr^k + for 
some 0 < h < |. 

Let H G APj.{n, cn) and run the SLOW-STRIP algorithm on H. Recall from Definition 17 
that 

7 = 3iCin^-^/^. 


and to is the first iteration of SLOW-STRIP in which the number of vertices in the remaining 
hypergraph is exactly [an -|- 7 J. The Second Phase of SLOW-STRIP consists of iterations 
t > to- Lemma 18 enables us to focus on the case when the algorithm does enter this phase. 


Definition 20. We say that Gt^ is nice if the number of light vertices in is at most 7 
and the number of hyperedges in Gt^ is f3n - 1 - 0 ( 7 ). 

Note that Corollary 9 implies that a.a.s. Gt^ is nice. 

Recall that for every t > 0, G* is the hypergraph remaining at the beginning of iteration t 
of SLOW-STRIP. The light vertices in Gt are defined to be the vertices with degree less than 
k and L* denotes the total degree of the light vertices in Gt- Recall that r is the iteration in 
which SLOW-STRIP halts. 

The following proposition follows immediately from Corollary 9 and Lemma 18. 
Proposition 21. A.a.s. to + 7/8 < r < to + k'y. 

Proof. By Lemma 18 and Corollary 9, a.a.s. the number of vertices removed in Phase 2 is 
between 7/8 and 7 . Since each step of SLOW-STRIP deletes at most one vertex, and each 
vertex entering the queue takes at most k — 1 < k steps to be removed by SLOW-STRIP, 
the number of iterations Phase 2 contains is a.a.s. between 7/3 and k'y. □ 


It will be convenient to dehne 


r = min{r, to + k'y}, 

and so Proposition 21 implies that 

a.a.s. T = T. ( 8 ) 

The following proposition follows from the definition of to and r. 
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Proposition 22. Assume r > to o,nd Gt^ is nice. Then for all < t <t, 

(a) the number of vertices in Gt is an + 0(7) and the number of hyperedges in Gt is 
/ 3 n + 0(7); 

(b) L, = 0(7). 

Proof. By definition, the number of vertices in Gt^ is cm+ 7 . Since Gt^ is nice, the number of 
hyperedges in Gt^ is ( 3 n + 0(7). Moreover, by the dehnition of r, r — to = 0(7). We remove 
one hyperedge in each step, so for all to < t < r, the numbers of vertices and hyperedges in 
Gt change by 0(7) from those in Gt^. This immediately conhrms part (a). Similarly, since 
GtQ is nice, Lt^ = 0 {pf). This variable changes by at most k in each step and so Lt = 0(7) 
for every to < t < r and this proves part (b). □ 


4.4.1 Creating light vertices 

The key to analyzing the evolution of Lt is determining the rate at which new vertices are 
added to i.e. become light. We begin by examining the distribution from Observation 13. 

Given positive integers TV, D and fc > 0 such that D > kN, dehne Multi{N, D, k), the 
truncated multinomial distribution, to be the probability space consisting of integer vectors 
X = (JVi,..., Xn) with domain ;= {d = (di, ..., djy) : = D, dt > k, Vi G [TV]}, 

such that for any d G X^, 


Pr(X = d) 


D\ 



nie[Af] 

SdGZj, riiGlV] 


where 


deXfe ie[Af] 


1 



The degree distribution of the heavy vertices of Gt, conditional on X}, is exactly Multi{Nt, Dt, k), 
by Observation 13. It was proved in [6, Lemma 1] (also appeared in [2, eq. (7)]) that the 
truncated multinomial variables can be well approximated by truncated Poisson random 
variables with expectation Dt/Nt. The result is stated as follows. Recall the dehnition of 
/fe(A) from (2). We dehne: 

gk{X) = A/,_i(A)//fc(A). (9) 


Note that gk{X) is the expectation of a Poisson random variable with parameter A truncated 
at k. 


Proposition 23. Let k > 0 be fixed, and TV and D satisfy D — kN = Q{N). Assume 
X Multi{N, D, k). For any j > k, let pj denote the proportion of X that eguals j. Then, 
with probability 1 — o(l/TV), 


where A satisfies gk{X) = D/N. 


( 10 ) 
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By Lemma 27 (below), gk{x) is an increasing function on x > 0. It is easy to show that 
hmj._j.o 5 'fc(x) = k. Hence, for any D > kN, there is a unique A that satishes Xfk-iW/fkW = 
D/N in the above proposition. 

It is easy to check from the dehnition of iir,k above (4) that 

gk{g-r,k) = ( 11 ) 

Dehne: 

C = Cr,fc = r^ja. (12) 

The following lemma justihes that C > k. The proof consists of some tedious calculus, so we 
defer it to the Appendix. 

Lemma 24. Suppose r, fc > 2 and (r, k) ^ (2, 2). Then, k < ( < r{k — 1). 

Thus, Proposition 23 and Corollary 9 imply that a.a.s. the proportion of degree k vertices 
in the k-core is approximately 


Pr,k = e 




fk (/^r,/c) 


(13) 


The following technical lemma has appeared in several other papers (e.g. in [1]), but as 
the proof is short we include it in the Appendix. 


Lemma 25. For every r,k >2 (r, k) ^ (2, 2), 


kpr^k ‘ ^ 1 

rfd (r — l){k — 1) 

And now we can prove the key lemma governing the evolution of Lt. Recall from above (8) 
that T = min{r, to + fcy}. 

Lemma 26. Assume that r > to and Gt^ is nice. For any to < t < r, iFt rnust be such 
that: for each of the r — 1 random vertex-copies chosen during iteration t, the probability, 
conditional on Tt, that the vertex-copy belongs to a heavy bin which becomes light in this 
iteration, is l/(r — l)(fc — 1) + 0 ( 7 /n). 


Proof. By Observation 13, we can treat the allocation of the Dt heavy vertex-copies to the 
Nt heavy bins as a uniform allocation subject to each bin receiving at least k copies. We hrst 
choose the r — 1 vertex-copies, then we choose the allocation. Let x be any one vertex-copy 
and let b be the bin to which x is allocated; we wish to bound the probability that |6| = k. 

Consider a different experiment: hrst conduct the allocation, and then choose x uniformly 
from amongst all vertex-copies, and let b be the bin to which x was allocated; clearly these 
two experiments are equivalent ways to choose b. Let A* be the unique root of gkW = F)t/Nt. 
By Proposition 23 the probability that we choose x from a bin of size k is: 


''“a"' 


-A* 


A* 


fk{\*)k\ 


Nt + 0{n ^/^logn). 
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(Note that the o(l/n) failure probability in Proposition 23 is absorbed into the log n) 

term.) 

Since Gtg is nice, Proposition 22 yields: 

Nt = an + 0{'y), = r/3n + 0 ( 7 ). (14) 


So 

9k{X*) = = ^ + 0 ( 7 /n) = gk{f^r,k) + 0{-f/n). 

an + (J['y) a 

Let 5 = g[{fir,k)', recall that g'l^ and pr.fc are independent of 7 ,n and so <5 is a constant 
independent of 7 ,n. Lemma 27 (below) implies that 5 > 0. Thus 5 '(,(A) > ^6 for all 
A = fJ^r,k + o{n); similarly, Lemma 27 implies that for any C > 0, 5 'fc(A) > 0 for all A > (n. 
These two bounds, along with (14) easily imply that 


A* = iJ^r,k + 0{'y/n). 


k 

Setting h{x) = recalling (13), and noting that h'{fir,k) = 0(1); and applying 

Lemma 25, we have 


q 


kNt 

Dt 


h{X*) + 0{n-^^Hogn) 


(r 


1 


Oh/n). 


ka 


Kg.r,k) + 0 ( 7 / 77 ,) 


kap'p^k 

rf3 


+ 0(7/n) 


□ 

We close this subsection with a technical lemma proving the monotonicity of gk{x). The 
proof will be given in the Appendix. 

Lemma 27. For any x > g'^{x) > Q. 

4.4.2 Proof of Lemma 19 


Recall that we are running SLOW-STRIP on the AP-model. Recall also from Section 4 
that Lt denotes the total degree of the vertices with degree less than fc in Gj, and recall from 
Dehnition 17 that to is the beginning of Phase 2 of SLOW-STRIP. Corollary 9 ensures that 
a.a.s. Gtg is nice and so Proposition 22 immediately yields a weaker version of Lemma 19: 
a.a.s. for all to < t <t, Lt = = 0 ( 7 ). This allows us to prove that a weaker form 

of Lemma 16 holds for t > to'- 

Lemma 28. Assume that t > to and Gtg is nice. For every to <t <t, 

nLt+i\J^t)=Lt±0{n-^/^). 
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Proof. Let v be the vertex taken from at iteration t. Let Ui,... ,Ur-i denote the other 


vertex-copies (except the copy in v) that are selected to be deleted in this step. The removal of 


the vertex-copy in v contributes —1 to ALt := always. We consider the contribution 

to ALt from the removal of Ui. Let v{ui) denote the bin that contains Ui. There are four 
cases. 

Case 1\ The removed edge contains two copies of the same vertex. The probability of 
this case is 0{l/n) and so the contribution of this case to K{Lt+i — Lt \ J^t) is 0(l/n). 

For the remaining three cases, we can assume that no other copy in v{ui) has already 
been removed during iteration t. 

Case 2\ v{ui) was in ..St in Gt- In this case, the contribution of the removal of ut to 
ALt is —1. Since Ui is chosen by the algorithm u.a.r. from all remaining vertex-copies, the 
probability of this event is 0{Lt/{Dt + Lt)) = 0{'y/n) by (14) and Proposition 22(b). 

Case 3: v(ui) enters St at iteration t. In this case, the size of v{ui) is k in Gt and 
the contribution to ALt from the removal of is /c — 1. The probability of this event is 
l/(r — l)(/c — 1) + 0 ( 7 / 77 ,) by Lemma 26. 

Case 4- v{ui) was not in St and does not enter St+i at iteration t + 1. In this case, the 
size of v{ui) is more than k in Gt and the contribution to ALt from the removal of Ut is 0. 
The probability of this event is 1 — l/(r — l){k — 1) + Of-^/n). 

By the linearity of expectation, summing the contributions of mi, ... we have 




1 


¥.{Lt+i-Lt\J^t) = -l + (r-l) (^(- 1 )- 0 ( 7 / 71 )+ (fc-l) 1^— 

+0{l/n) = 0(7/77). 

The lemma follows by noting that 7 = ©( 77 ^“'^'^^) by Dehnition 17. 


(r — l){k — 1) 


□ 


The following lemma is a simple application of the Hoeffding-Azuma Inequality [3, 20]. 

Lemma 29. Let and Cn > 0 be real numbers and (A'„,j)i>o be random variables with 
respect to a random process (G„_j)j>o such that for all t >0 



and \Xn,i+i — Xn,i\ < Cn, for every i > 0 and all (sufficiently large) n. Then, for any real 
number j > 0, 



Proof. Dehne = Xn,i — ian for all 7 > 0. Then, 

E(W ,2+1 I ) = E(w„ ,*+l I {On,s}s<i) (i -|- l)(7ri ^ Xn^i ia^i 

Thus, (W,i)i>o is a supermartingale. Moreover, |W,i+i — W,i| < + l^nl- By the Hoeffding- 

Azuma Inequality, 



This completes the proof of the lemma. 


□ 
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We now recall the statement of Lemma 19: 


Lemma 19. If c = Cr^k + n then a.a.s. for every to < t < t: 

u = 0{n}-^). 

Proof. Without loss of generality we may assume to < '>'■ By Corollary 9 we may also assume 
that Gto is nice. Recall that r = min{r, to + kj}. By Lemma 28, there exists a nonnegative 
sequence {an)n>i such that = 0{n~^/‘^) = 0{'y/n) and for every to <t <t, 

I Ji) < Li + On, E(—Li+i I Li) < —Li + On- 

Dehne Ct such that Ct = Lt for every t < r and Ct+i = Li for all t > r. Now for every 

t>to, 

E(Li+i I {Ls}s<t) < Li + ttn, E(—Li+i I {Ls}s<i) < —Li + a„. 

It is clear that |Li+i — Li| = 0(1) always as Li+i = Li + 0(1) always. By Lemma 29 with 
j = 7^0 log 77 , and c„ = 0 ( 1 ), for every to < t < to + k'y, 

Pr(Li > Lio+a„(f-fo)+7^'^^ log’^) = o{n~^), Pr(-Li > -Li(,+a„(f-fo)+7^'^^ log’^) = o{n~^). 

We apply the union bound over all to < t < to + fcy, along with the asymptotics a„ = 
0(7/77,), and 7^0 log 77 = 0 ( 72 / 77 ) (since 7 = 3iCi?7,^“^0 and § < 1 / 2 ), to obtain that a.a.s. 
for all to < t < to + k'-y: 

Ct > Li„-0(7V^) (15) 

Li < Ct, + 0{jyn) (16) 

By ( 8 ), a.a.s. r = t and so a.a.s. r = r < to + ^7 and = Lt- = = 0. Therefore, (15) 

with t = T yields 

Lto = Ct^ =0{-f^fn). (17) 

Substituting that into (16) yields that for all to < t < t, Lt = Ct = O( 7 ^/ 77 ) = 0(77^“*^), 
thus establishing the lemma. □ 

4.4.3 Ct and pt 


In this section, we study two parameters Ct and pt to be dehned below, which allow for 
a more careful analysis of the rate at which new light vertices are formed. 

Recall that Lt is the total degree of the light vertices in Gt (i.e. those of degree less than 
/c); eventually, our goal is to prove Lemma 16, i.e. to bound E(Li_|_i — Lt \ L)) from above. 
This quantity has been bounded from below in the previous section for t > to (Lemma 28). 
The main challenge for Lemma 16 is that we need to show a uniform bound for all t > t{B), 
as long as R is a sufficiently large constant (recall that to grows with 77). This also makes a 
signihcant difference from Lemma 28 which only requires t > to- 

Now let H G APr{n, cn), where c = Cr^k + and let R be a sufficiently large constant 
whose value is to be determined later. Recall that Gt is the hypergraph remaining after t 
iterations of SLOW-STRIP and so Gt(B) = Hb- 
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Recalling that Nt is the number of heavy vertices, and Dt is the total degree of the heavy 
vertices, we dehne: 

Ct = = Dt/Nt is the average degree of the heavy vertices in Gt- 

Recalling that, by Corollary 9, the fc-core a.a.s. has an + o{n) vertices and fin + oin) edges 
and C = rf}/a. Therefore the fc-core a.a.s. has average degree C + o(l) and so Ct approaches 
C. Note that C = 5'fc(hr,fc) by (11). 

We are interested in the probability that a particular heavy vertex-copy is allocated to a 
bin of size k since this tells the proportion of the heavy vertices in deleted hyperedges that 
become light. So, recalling Observation 13, we define: 

Definition 30. pt = ptiTt) is the probability that a given vertex-copy is assigned to a 
bin of size A: in a uniformly random allocation of Dt points to Nt bins subject to each bin 
receiving at least k points. 

We will deduce pt as a certain function of Ct, approximately. For all x > k, we dehne: 

X{x) is the root of fi'fc(A) 
pj^x) 

With some basic calculations we can show that pj{x) is strictly decreasing for x > k. 
Lemma 31. For all x > k, p>k{x) < 0. 

We defer the proof to the Appendix. 

In the k-core, a.a.s. the proportion of the total degree that comes from vertices of degree 
k is approximately pj{C) by Corollary 9 and Proposition 23. 

Since pt approaches '0(C), we should have pt ~ '0(Ct)- Our next lemma formalizes this 
approximation: 

Lemma 32. Assume Nt = VL{n) for every t < t. Then pt = {1 + \ogn))pj{pt) for 

all t < T. 

Proof. Let A be chosen such that A/fc_i(A) = Ct/fc(A)- Then by Proposition 23, the following 
holds for all f < r: 

= + by (18), □ 

We will analyze Ct,Pt in order to prove Lemma 16. 

Recall that gk{x) = xfk-i{x)/fk{x) as dehned in (9) and that C = r(3/a. Then, g,r,k is 
the root of gk{x) = C by (11). Recall also that 

\{xY~^ 

“ A_i(A(x))(fc-l)!’ 


= x; 


i.e. 


Xfk-iiX) = xfkiX) 


fk-i{X{x)){k - 1)! 


(18) 

(19) 
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where A(x) is the root of gk{^) = x. Now define p* = '0(C)- Then, we have 


P = 


D l^r,k ! M 1 


fk-l{Pr,k){k - 1 )!' 


By the dehnition of a, /3 below (4) and pr^k in (13), we have p* = kpr,k/C- Then, by 
Lemma 25, we have p* = l/(r — l){k — 1). As a snmmary of the above discussion, we have 
the following equalities. 


9k (^Pr,k') 


P 


* 


C = r^/a] 

= 7-TT77-TT- 


( 20 ) 

( 21 ) 


The expected change in L^, is closely tied to Ct- So in order to prove Lemma 16, we 
begin by bounding Q over the next two lemmas. Recall that C,t = Dt/Nt denotes the average 
degree of heavy vertices in Gt- We will apply Lemma 32 to relate pt with '0(Ct) for all t in 
the range we analyse and therefore we must restrict to a sequence of “typical” hypergraphs 
{Gt) such that Nt = Q{n) holds in the whole process. To formalise the idea, we define: 


Gt is normal if \pt — '0(C0| < n log^ n. 


Note that whether Gt is normal is determined by J^t so, equivalently, we could say J^t is 
normal. We define the stopping times: 


Ti is the minimum integer t such that Gt is not normal; (22) 

Ti = r if Gt is normal for all 0 < t < r. 

We will focus on steps t such that Ct is close to (. Hence, we will restrict to sequences {Gt) 
such that (t gets close to ( eventually. Given a constant e > 0, define: 

A is the minimum integer that ||l/(Gt)| — an\ < en and \Ct — Cl ^ e; (23) 
te = r if such an integer does not exist. 

Observation 33. For all te < t < A + ken, we have Ct = C + 0{e). 

Proof. Each iteration of SLOW-STRIP removes exactly one hyperedge. So it reduces each 
of Dt and Nt by at most r. Since Dt,^ > rfin and Nt^ > an, each step can only change 
Qt = Dt/Nt by at most 0(e), and this will be true for at least ken steps. □ 

Lemma 34. 

(a) There are a sufficiently small constant e = e{r, k) > 0 and two constants pi = pi{r, k) > 
0 and p 2 = P 2 (b k) > 0 such that such that for all t^ <t < min{A + ken, ti}, 

n n 
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(b) Given a constant e > 0, a.a.s. there is a large constant B = B{r,k,e) for which: if 
c = Cr^k + then t^ < t{B). Moreover, a.a.s. Ti = r and r < + ken. 

Proof. Let e > 0 be be a small constant whose value is to be determined later. We first 
prove part (a). We may assume that t^ < ti since otherwise there is nothing to prove. 

Consider any t > t,,, and let v be the vertex taken from during iteration t. Recall that 
when SLOW-STRIP runs on the AP-model, it removes one vertex-copy from v and another 
r — 1 vertex-copies ui,... ,Ur-i chosen uniformly at random from the remaining ones. We 
will split the single step into r — 1 substeps such that Ui is removed in step Tt^i 

for all 1 < i < r — 1 and let o = t. We consider the contribution of deleting Ui, denoted by 
C{ui), to IE(Ct+i — Ct\ If Wj+i is light then C{ui) = 0. If it is heavy (which occurs with 
probability at least 1/2 if e is sufficiently small) then (extending the dehnition of Dt,Nt,pt 
to Dxt i , , pTt,i in the obvious manner): 


C{u, 


D 


Tt,i 


w 


^ n - 'i -L 
-g-ptM + 


D 


Tt, 


^Tt,i - 1 


PTt,i 


TtA 


Nt, 




(24) 


In this case, dropping the subscript and substituting l/(iV — 1) = (l/iV)(l -|- 0(l/n^)) 

(as N = Q{n)), we have that uniformly for all 1 < i < r — 1, 


C{u.) = l-l^(D-l){l-p)+p{D-k){l + l/N + 0(n-^))-D'j 
= i( - 1-p(£> - 1)+p(£> +^ - «: + 0(n"‘))) 

= i(^-l+p(^^-(fc-l) + 0(n-')jj. (25) 

By the dehnition of t^ and applying Observation 33, we have (t = rfd/a + 0{e)= ( + 0(e) 
for every te < t < te -|- ken. Recall from (21) that 


P* = ^(0 


1 

(r- l)(fc- 1). 


By Lemma 24 ( > k. Since ( = (r,k is a constant. Lemma 31 implies that 'ip'iC) < 0 and 
ijj'lx) is uniformly bounded away from zero in a small neighbourhood of (. By the dehnition 
of Ti in (22), for all t^ < t < min{0 -|- ken, Ti} and 0 < i < r — 1, 


PTt,i = + o(l) = V'(C) + 0{e) =p* + 0(e). 

If we were to substitute p = p*, D = rfdn, N = an into the RHS of (25), and simplify it, we 
would obtain: 


C{ui) 


+ yVTj)pVT)(C-(*^-l) + 0(n-‘))). 


(26) 
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By Lemma 24, there is a a > 0 such that ( < r{k — 1) — a. Hence, 


< —cr/2(r — l){k — 1). 

Since in every step, the quantity of variables in (25) (e.g. p and D/N) differs from that 
in (26) by 0(e), by choosing e > 0 sufficiently small, there exists constants p'l, p '2 > 0 such 
that a.a.s. ^ ^ 

— — <C{ui) < —— for all 1 < i < r — 1, 
n n 

for all < t < min{te + ken^Ti}. Since there are r — 1 Wj’s, part (a) follows with pi = 
(r - l)p'i,p 2 = (r - 1 )V 2 - 

Next, we prove part (b). By Lemma 15, there is a constant B = B{r,k,e'), such that 
a.a.s. \V{Gt{B)) \Ca:| < By Corollary 9, a.a.s. the number of vertices and hyperedges in 
Gt(B) is an + 0{e'n) and r(3n + O(e'n) respectively. Therefore, \Ct{B) ~ Cl < for some 
constant A > 0. We choose e' sufficiently small (correspondingly B sufficiently large), so 
that e' < e and e' < e/A. Thus we have t{B) > t^. Hence, a.a.s. there exists large constant 
B = B{e) so that 0 < t{B). The fact that a.a.s. ri = r follows from the dehnition of ti 
in (22) and Lemmas 32 and 7.. 

The fact that a.a.s. r < t^+ken follows by the definition of in (23) and Corollary 9: a.a.s. 
there are at most en + o{n) vertices to be deleted from Gt^ until SLOW-STRIP terminates, 
and it takes at most k — 1 steps to remove each vertex in 

□ 

In the next lemma, we obtain a coarse bound on (t, which will suffice to prove Lemma 16. 
We will rehne this bound in the later part of this paper when we require a stronger form of 
Lemma 16 (see Lemma 50). 

Lemma 35. For any sujficiently small constant e > 0, there exists a sufficiently large 
constant B such that for every e' > 0, a.a.s. for all t > t{B): Ct ^ Ct — and 

\Ct-C\<e. 

Proof. Take a sufficiently small constant ei to satisfy Lemma 34(a); then a sufficiently large 
constant B = B{r,k,ei) as in Lemma 34(b). Then, a.a.s. 

< t{B) and ri = r < + kein. (27) 

Moreover, by Lemma 34(a), for all t^^ <t < t' := min{fej +kein, Ti}: Ct is a supermartingale 
and Ct+i — Ct is at most 0{l/n). We couple Ct with another process Zt as in the proof of 
Lemma 19: for all t^^ < t < r', let Zt = (p for all t > r', let Zf+i = Zf. Now, {Zt) is a 
supermartingale for t > t^.^ and Zt+i — Zt + 0(l/n). By Azuma’s inequality, we have that 
for all fej <t 2 <ti< fej + kein and for any j > 0, 

Pr(Z,. - Z,. > V < exp (-n 
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Taking j = and taking the union bound over all pairs < ti < ^2 < + kein, the 

probability that there is a pair ^2 < in the above range such that Zt^ — Zt^ > is 

at most n^exp (—hi )) = o(l). Hence, a.a.s. Zt^ > Zt^ — for any pair H < 

h < tei + kein. 

By (27), a.a.s. t{B) > and t' = ti = t and so a.a.s. Zt = (t for all < t < r. So, 
a.a.s. C,t> Ct — for all t > t{B). 

To prove that a.a.s. |Ct ~ Cl ^ we simply choose ei to be sufficiently small in terms of 
e (and the implicit constant in Observation 33). (27) yields that a.a.s. for all t{B) < t < t 
we have <t < + keiu, so Observation 33 yields 

|Ct-CI<0(6i)<6. □ 


This immediately yields the following corollary. 


Corollary 36. For any sufficiently small constant e > 0, there exist constants B,K > 0 
such that a.a.s. for all t > t{B): 

(a) Ct>C + Kn-^'^; 

(b) \Ct - Cl < e- 

Proof. Choose B and e to satisfy Lemma 35. Immediately, we have a.a.s. |Ct ~ Cl ^ ^ for all 
t > t{B). 

Recall the definition of gk{x) in (9). By Lemma 7 and since /ir,fc > 0, a.a.s. 


c. 


r(3{c) + 0(n^/‘^) 
a{c) + 0{n^B) 


9Mc)) + 0{n 




By Lemma 27, g'k{h‘r,k) > 0. By Lemma 8, /i(c) —jJir,k = Kin ‘^/^ + o(n for some constant 

Ki > 0. Recall also that C, = rf^ j a = gk{g'r,k)- So a.a.s. 

C, = C + 0(n-^/2), (28) 


as n is absorbed by o{n ^/^) since 5 < 1/2. Now applying Lemma 35 with some \ 
so that = 0 ( 77 ,“*^/^), we have that a.a.s. 

for all t > t{B), Ct > Cr — > C + Kn~^^‘^, 


for some appropriate constant A' > 0. 


□ 


4.4.4 Proof of Lemma 16 


Recall that we analyze the running of SLOW-STRIP on APr{n, cn). In each step t{B) < 
t < T, the algorithm removes a vertex-copy in a light vertex and another r — 1 vertex-copies 
Ml,..., Ur-i chosen uniformly from all remaining ones. For each 1 < i < r — 1, let ht^i denote 
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the probability that Ui is light. Recall that pt is the probability that a heavy vertex-copy is 
allocated to a bin of size k. Then 

r—1 

— Lt \ J^t) = ~1 + ^ ^ f ^ + (1 ~ ht^i){k — l)pt + 0{n (29) 

i=l 

where 0{n~^) accounts for the change of pt caused by the removal of the hrst i — 1 points 
and the possibility that we select two copies of the same vertex. (29) is maximized when 
ht^i = 0 for all 1 < i < r — 1. Thus, 

E{Lt+i - Lt I J^t) < -1 (r - l)(/c - l)pt -f 0{n~^). (30) 

As in the proof of Lemma 34, we need a relation between pt and (t and thus it is 
convenient to restrict to steps t < ti where Ti is defined in (22). Moreover, we want to 
restrict to sequences {Gt) such that Corollary 36(a,b) hold. To formalise the idea, we first 
choose constants B, e and K to satisfy Corollary 36 (Note that e can be chosen arbitrarily 
small which results in larger B). Then, we dehne: 

T2 is the minimum t > t{B) such that (t < C + or — (C| > e; (31) 

r 2 = r if no such integer exists. 

Define 

r* = min{ri,r 2 }. (32) 

By Lemma 34(b) and Corollary 36, a.a.s. r* = r. 

By the dehnition of ri in (22) and noting that t* < ri, we have for all t{B) < t < t*: 

Pt = pj{Ct) + log^ n = ip{C) + pj'iOiCt - C) + 0{{Ct - Cf) + log^ n, (33) 

by expanding "^(Ct) at (. 

By Lemmas 31 and 24, and since ( = ({r, k) is a constant, we have pt'iC) < for some 
constant C > 0. By the definition of T2 and r*, for all t{B) <t<T*,(t — C can be assumed 
sufficiently small (by choosing sufficiently small e) so that 

- C) + o((Ci - O') < -^iCt - 0- 

By (21), 7p{() = Corollary 36, there is a constant Ki > 0 such that a.a.s. 

for all t{B) < t < r*, 0 > C + Kirr^/‘^. Putting all this together yields: 


Pt < 

< 


1 

(r — l){k — 1) 
1 

(r-l)(/c-l) 


— ^Kin n log^ n 

- K2n-^/\ 


for any constant K 2 < iLiC/2, as 5/2 < 

It follows then from (30) that there is a iC > 0 such that for all t{B) < t < r*, 

E(L,+i - L, I J-,) < -Kn-‘'\ 


(34) 


(35) 


By (33), this holds a.a.s. for all t{B) < t < t*, and by (32) a.a.s. t = t*] this proves the 
lemma. □ 
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4.5 Proof of Theorem 3(b) 

In this section, we prove Theorem 3(b). So |c — Cr^k\ < for some 0 < 5 < |. Onr goal 
is to show that a.a.s. the stripping number oi H E 'Hr{n,cn) is Similarly to the 

supercritical case, we will analyse the process on H E APr{n,cn) instead. 

We will define 5' ~ 5 so that c is not very far from c' = Cr^k + and then argue that the 
stripping number of H is not much smaller than the stripping number of H' = APr{n, c'n). 
We have already proven that the latter stripping number is VL{n ^'in Section 4.3.1. 

We specify a small e > 0 and define 5' such that: 

J n~^ if c = Cr^k — where S < 1/2 — e 
^ ii \c — Cr^k\ < . 

Remark So if |c — Cr^k\ < then 5' is a function of n, rather than a constant. We can 

still apply the results of Section 4.3.1 to H' as the proofs work even for non-constant 5, see 
the remark following the statement of Theorem 3. 

It follows then that in both cases, 

< n~^ — (c — Cr^k) < ‘in~^ . (36) 

To prove Theorem 3 for these ranges of c, it suffices to show that a.a.s. s{T-ir{n, cn)) = Vt{rA' 
with 5' dehned above, since n^' 

Let H' = APr{n, c'n) where c' = Cr^k + , and we generate H by removing (c' — c)n 

edges (i.e. r-tuples), chosen uniformly at random in H'. This way we couple H ^ H' and H 
is distributed as APr{n,cn). Consider the following stripping procedure to hnd the fc-core 
of R; 

1. Run Phase 1 of SLOW-STRIP on H', thus obtaining G'^^ C H'. 

2. For each vertex v removed from PI' in Step 1, we also remove v from H. We call the 
remaining hypergraph Gto- 

3. Run SLOW-STRIP on Gt,. 

To be clear: 7 and to are defined for the process of running SLOW-STRIP on H', not for 
H. So: 

7 = 

and to is the hrst step at which exactly an + 7 vertices remain in H'. 

Note that since H C H', the fc-core of H is contained in the k-core of H', and so the 
k-core of H is contained in G'^^. Gt^ contains every vertex of G'^^ and contains every edge of 
G'^^ that is an edge of H. So the fc-core of H is contained in Gto- Thus, this is a valid way 
to obtain the fc-core of H. 

Note also that this is not equivalent to running SLOW-STRIP on H, since doing so 
could remove the vertices in a different order. Nevertheless, we still have Giq C H and so 
the stripping number of H is at least the stripping number of Gt^- 
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The number of hyperedges in H' but not in H is (c' — c)n, which is by (36). We 

use Gi to denote the subgraph of H remaining after i —to iterations of SLOW-STRIP on Gt^. 
We define Li,Di,Ni as we did in Sections 4.4.1 and 4.4.2. Similarly we dehne 
to be the values of the same parameters for G[^. 

G[^ is the result of carrying out Phase 1 of SLOW-STRIP on H' = APr{n,c'n). So a.a.s. 
satisfy the bounds in (14) and (17). Clearly, \Lt^ - L'J, |Ao - D'J, iWo - 
are bounded by r times the number of edges in E{G^^)\E{GtQ). Note that \E{G^^)\E{Gtf^)\ 
is at most the number of edges in E{H')\E{H), which is ©(n^”^') = 0 ( 7 ). Combining this 
with the bounds (14) and (17) on L'^^, we have: 

^to = + 0 ( 7 ) = an + 0 ( 7 ) 

Dto = + 0 ( 7 ) = r^n + 0 (j) 

K = L(^+0(7Vn) = 0(7Vn) 

Next we prove that a.a.s. SLOW-STRIP applied to Gtf, lasts for at least 7/4 steps. 
Applying Proposition 21 to H' tells us that applying SLOW-STRIP to G[^ would take at 
least 7/3 iterations. Each iteration removes one hyperedge, and the removed hyperedge is 
not in the k-core of G[^ and hence not in the fc-core of Gto- We have shown that at most 
©(n^“'^') = 0 ( 7 ) of those hyperedges are not in Gt^- Therefore, SLOW-STRIP takes at 
least 7/3 — 0 ( 7 ) > 7/4 iterations on Gt^ (c.f. Lemma 18). It is then convenient to define 
r' = min{fo + 7 / 4 :, r} and we have just shown that a.a.s. r > r'. 

Corollary 9 implies that a.a.s. G[^ is nice. Since Gt^ contains all but 0 ( 7 ) hyperedges 
of a.a.s. Gt^ is also nice. Thus, the analysis of Section 4.4.2 applies to the running of 
SLOW-STRIP on Gt^- The analysis is similar except that 5 is replaced by 5', and yields the 
conclusion of Lemma 28; i.e. for every <t < t', 

E(Lt+i| J-*)=Li± 0 (n-^'/ 2 ). 

The same analysis as in the proof of Lemma 19 yields that (16) holds; i.e. a.a.s. for every 
to < t < r', 

L,<L,, + 0{jyn) = 0{n^-^'). (37) 

The rest of the proof follows as in Section 4.3.1. We argued above that SLOW-STRIP 
takes at least 7/3 — 0 ( 7 ) > 7/4 iterations on Gto (thus, a.a.s. t' = to + 7 / 4 ). It takes at 

most k — 1 iterations to remove a vertex, and so the parallel stripping process, applied to Gt^ 

removes at least 7 /( 4 (/c — 1)) vertices. By (37) each iteration i removes \Lt(^i)\ = ) 

vertices. So there must be at least 7 / 0 (n^“^) = (recalling that 7 = 0(n^“^/^)) 

iterations of the parallel stripping process, he. the stripping number of Gto least 

and hence the stripping number of H is also Vt{n^' 

This proves that the lemma holds for APr{n, cn). Corollary 12 implies that it also holds 
for 77 r(’ 2 , cn). □ 


5 Bounding the maximum depth: proof of Theorem 5 


We first note that the stripping number provides a lower bound on the maximum depth 
over all non-fc-core vertices. 

Lemma 37. For any vertex v G Si, the depth of v is at least i. 

Proof We prove by induction that every vertex v G Lij>iSj has depth at least i. 
This is trivial for i = 1. Suppose it is true for i, and consider v G Uj>i+iSj. Since v ^ Si, v 
has at least k neighbours which are either in the k-coie or in Uj>iSj. At least one of those 
neighbours must be removed before v can be removed, and by our induction hypothesis, each 
such neighbour has depth at least i. So any stripping sequence which removes v must hrst 
include a sequence of length at least i which removes a neighbour of v. Thus it must have 
length at least i + 1; i.e. the depth of v is at least i + 1. □ 

Therefore, Theorem 3(a) provides the lower bound of Theorem 5. We will focus on the 
upper bound. 

Recall that Si is the set of vertices removed during iteration i of the parallel stripping 
process, and Hi is the subhypergraph remaining after Si,Si-i are removed. 

Dehne: 

7max is the number of iterations carried out by the parallel stripping process. 

We dehne the following hypergraph formed by the vertices of Sp. 

Definition 38. The vertices of Si are Si. For any hyperedge / in Hi that includes at least 
one vertex of Si, /' = / fl Aj is a hyperedge of Si. If |/'| = a then f is said to be an a-edge. 

Note that Si may contain hyperedges of size one (in fact, for large i, most of the edges 
will have size one). 

We wish to bound the depth of a non-/c-core vertex v. We begin by dehning a set R{v) 
that contains such a stripping sequence. 

Definition 39. For each 1 < i < /max and any v E Si, we set /?' = R'i{v) := {u} and for 
each i = i to 1: 

(a) we set Rj = Rjiy) to be the union of the vertex sets of all components of Sj that 
contain vertices of /?'. 

(b) we set /?)_i to be the set of all vertices v G Sj-i that are adjacent to U^^^Ri. 

We dehne R{v) = Uj^iRi. 

Remark. For the purposes of this paper, we could have omitted step (a) and replaced R' 
by R elsewhere. The set R{v) would still have contained a stripping sequence leading to the 
removal of v. We dehne, and bound, this larger set for our application to clusters in random 
XOR-SAT in another paper[16]. (A preliminary version is in [17]). 

Observation 40. R{v) contains a stripping seguence ending with v. 
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Proof We prove this for each v E Si using a simple induction on i. For i = 1, v 
is a stripping sequence of length 1. For i > 2, note that R{v) contains every neighbour 
of V lying in levels — 1; call these neighbours Ui,...,Ug. The recursive construction 

ensures that R{v) also contains R{ui),R{uq) and so, by induction, contains stripping 
sequences ending with ui, ...,Uq. After the deletion of ui, ...,Uq, the degree of v drops below 
r (since v E St), and so adding v to the concatenation of those stripping sequences produces 
a stripping sequence ending in n. □ 

Theorem 41. Let r,k >2, {r,k) ^ (2,2) he fixed. There is a constant k = K{r,k) such 
that: for anyO < 6 < 1/2, if c = Cr^k + n~^, then a.a.s. for every v E Rr{n,cn): |-R(n)| < . 

We will prove the upper bound in Theorem 41 by showing that a.a.s. |i?(n)| = for 
every non-/c-core vertex v. Here and in the following sections, the implicit constant involved 
in 0{.) is always independent of 6. Theorem 5 follows immediately from Theorem 41 and 
Observation 40. 

We close this section with an overview of the proof for Theorem 41. One natural approach 
to bound R{v) is to: (a) carry out the stripping process until v is removed in some iteration 
i, then (b) explore R{v) starting with Ri{v) and working through Ri^fv), Ri- 2 {v), ...Rfv) 
according to Dehnition 39. However, after exposing all the edges deleted in iterations 1, ...,i 
in part (a), we have no randomness left to facilitate the analysis of part (b). 

To overcome this difficulty, we only expose a minimal amount of information in part (a). 
We expose the vertices of each S'*; some information about the deleted edges, such as the 
number of vertices an edge has in Si and in Fj+i; and some degree information. But crucially 
we do not expose the actual vertices in each deleted edge. When we carry out part (b), we 
expose the vertices of the edges relevant to R{v). This is enough randomness for us to bound 
|i?(n)|. 

To carry out part (a), we complete the entire stripping process using a procedure called 
EXPOSURE. The parameters we expose in this phase are listed at the beginning of the 
next subsection. Lemma 47 bounds these parameters. We then expose the vertices in all 
deleted edges using a procedure called EDGE-SELECTION. We use this procedure to expose 
the vertices in Ri{v),Ri-i{v), ...Rfv) and obtain a recursive bound on |i?j(n)| in terms of 
Rj+i{v), ...,Ri{v); see Section 5.2 and its proof in Section 5.4. 

5.1 Vertex-exposure and edge-selection 

As in Section 4, we will analyze the running of SLOW-STRIP on the AP-model. We 
will work with the AP-model for the remainder of Section 5. Again, when we use a graph 
theoretic term, we mean the obvious analogue for a conhguration. Recall: 

Imax is the number of iterations carried out by the parallel stripping process. 

We parameterize the hyperedges removed during iteration i as follows: 

Definition 42. A hyperedge removed during iteration i is called an (a,&)-edge, where a 
is the number of vertex-copies it contains from Si and b is the number of vertex-copies it 
contains from Si+i. Thus a > 1 and 0 < 5 < r — a and ii i = /max then we must have b = 0. 
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For every i,l < a < r,0 < b < r — a we define: 

is the number of (a,6)-edges in Hi. 

Definition 43. For each v G S'*: 

• d~^{v) is the degree of v in Hf, i.e. the number of vertex-copies of v that are removed 
during iteration i. 

• (if i < /max), d~{v) is the total degree of v amongst the hyperedges in i/j-i that contain 
at least one vertex of Fj-i; i.e. the number of vertex-copies of v in hyperedges that are 
removed during iteration i — 1. 

As described above, we expose the hyperedges removed during the parallel stripping 
process in two phases. First, we expose the vertices that are removed in each iteration, along 
with some degree and edge-count information; 

EXPOSURE: 

1. Expose /max- 

2. Expose the vertices in Si, 

3. For each 0 < i < /max and each vertex v E Si, u E Fj+i, expose d'^iv), d~{u). 

4. For each 0 < i < /max, l<a<r, 0</<r — a, expose M“’^. 

Of course, this also exposes the vertices and the number of edges in the Ucore Ck = Ck{H). 

To clarify what is exposed in terms of the AP-model: We have exposed the bins (vertices) 
in Si,..., For each bin v E Si, we have exposed the number of copies of v that are 

removed in iteration i — 1 and in iteration i respectively. In fact, for convenience we expose 
those actual copies. Other vertex-copies may still be allocated to bin v, but any such vertex- 
copy must be deleted during iterations 1, For each relevant i, a, b, we have exposed the 

number of r-tuples removed in iteration i that contain a vertex-copies from Si and b vertex- 
copies from Si+i, but for each such r-tuple, we do not expose the actual bins (vertices) those 
vertex-copies lie in, nor do we expose the remaining r — a — b vertex-copies in the r-tuple. 

Next we expose the actual hyperedges (i.e. r-tuples of vertex-copies which are allocated 
to bins) that are removed during each iteration. We dehne: 

Si is the set of hyperedges removed during iteration i. 

Si must satisfy the following conditions for each 1 < i < Imax- 

(PI) Each V E Si has exactly d'^{v) vertex-copies in Si. 

(P2) Each u E Si+i has exactly d~{u) vertex-copies in Si. 

(P3) Si contains exactly (a, 6)-edges for each a, b. 

(P4) In each of those [a, /)-edges, the r — a — b vertex-copies not in Si U S'j+i must be 
allocated to a bin in Ck Uj=i +2 
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Lemma 44. Conditional on the outcome of EXPOSURE, any set of hyperedges (r-tuples) 
satisfying properties (P1-P3) is egually likely to he £i. 

Proof Fix all parameters that are exposed in EXPOSURE (i.e. /max, Si, 
d'^{v), etc). Consider two sets of hyperedges (i.e. r-tuples of vertex-copies, and an allocation 
of those vertex-copies into bins) Ei, E2 satisfying properties (P1-P4). We need to show that 
El, E 2 are equally likely to be £i. 

Recall that a conhguration is a partition of the rm vertex-copies into r-tuples, and an 
allocation of the rm vertex-copies into the n bins. Let Hi be a conhguration that is consistent 
with the outcome of EXPOSURE (i.e. applying the parallel stripping process to Hi will result 
in the various parameters being equal to what was exposed in EXPOSURE), and such that 
applying the parallel stripping process to Hi will result in Si = Ei. Let H 2 be the hypergraph 
obtained from Hi by replacing Ei with E2. 

We claim that: H2 is also consistent with the outcome of EXPOSURE, and that applying 
the parallel stripping process to H 2 will result in St = E 2 . To verify this claim, we only need 
to show that, when we apply the stripping process to H2, we take /max iterations, and for 
each 1 < j < /max, the set of vertices removed in iteration j is Sj (the fact that E 2 satishes 
(P1-P4) conhrms the remainder of the claim.) This follows easily from the fact that Ei, E2 
do not include any hyperedges that contain vertices from Si,...,Si-i and that Ei,E 2 each 
consist of all the hyperedges that contain vertices from Si in what remains after removing 
Si,..., Si_i from Hi,H 2 , resp. 

So we have a bijection between the conhgurations which yield Si = Ei and Si = E 2 . 
Furthermore, every conhguration is equally likely to be chosen as APr{n,cn). Thus Ei and 
E 2 are equally likely to be Si, which establishes the lemma. □ 

So our goal is to choose a uniform set of r-tuples satisfying (P1-P4), for each 1 < i < /max- 

EDGE-SELECTION: For each 1 <i < /max, we expose Si, the hyperedges deleted in 
iteration r. We have already exposed M“’^, the number of (a, /)-edges. Each such edge will 
be assigned a vertex-copies from Si, b vertex-copies from Rj+i, and r — a — b vertex-copies 
from Cfc Uj=i +2 assign those copies as follows: 

1. Each vertex v E Si has d~^{v) copies. Noting that J2veSi ■: 

a uniformly random partition of all vertex-copies in Si so that each (a, fe)-edge receives 
a part of size a. 

2. Each vertex u G Si+i has d~{u) copies. Noting that J2ueSi 1 d~{v) = we 

take a uniformly random partition of all vertex-copies in Rj+i so that each (a, /)-edge 
receives a part of size b. 

3. For each (a, /)-edge, we choose the remaining r — a — b vertex-copies uniformly from 
those that have not yet been allocated to bins. Then we allocate each of these vertex- 
copies to a bin selected uniformly from Ck u j=i+2 

Any two sets of hyperedges (i.e. r-tuples of vertex-copies which are allocated to bins) 
satisfying (P1-P4) are equally likely to be chosen by EDGE-SELECTION. Therefore, if we 
carry out EDGE-SELECTION, then by Lemma 44, we have chosen the hyperedges from the 
correct distribution. 
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5.2 A recursive bound 

Consider some vertex v E Si and recall Definition 39 where we define R{v) = Ri{v)URi-i{v)U 
... U Ri{v). That definition naturally lends itself to a recursive bound of |i?j(n)| in terms 
of Rj-i{v),..., Ri{v). We present that definition in this subsection, prove it in Section 5.4, 
and then use it to bound |i?(n)| in Section 5.5. It will be convenient to restrict our analysis 
to deletion rounds which are late enough that certain bounds on various parameters hold. 
So we let B = B{r, k) be a sufficiently large constant to be named later (it will come from 
Lemma 47 below) and set 

= R{v) \ (Ui<,<B^*) (38) 

Lemma 49 bounds |7?(n)| = 0(|i?^^^(n)| Tlogn) and so bounding R^^\v) will suffice to prove 
Theorem 41. 

The expected size of Rj-i depends not just on the size of Rj, but also on the d~ values of 
the vertices in Rj. So we will recursively bound the sum of those values, rather than bound 
the (nearly equal) \Rj\. 

Definition 45. For X ^ Si, we define 

uex 

Note that if f > 1 then d~{u) > 1 for all u E Si as otherwise, u would have been deleted 
in iteration i — 1, and so D~{Rj{u)) > \Rj{u)\ for all j < i. Therefore, bounding D~{Rj{u)) 
will suffice to bound \Rj{u)\. 

Given our non-/c-core vertex v, we define: 

ly is the integer i such that v E Sf, i.e. the iteration during which v is removed. 

As described above, we are restricting our analysis to vertices deleted after iteration B, so 
we will assume that R > B. We will bound D~{R^^\v)). The recursion starts with a base 
case bound on D~{Rj^{v)), and then bounds D~{Ri^_i{v)), D~{Ri^_ 2 {v)),...,D~{Rb{v)). 
Thus, we express D~{Rj{v)) in terms of D~{RRv)) for i > j, rather than the usual i < j. 

Lemma 46. There are constants B = B{r,k), Z = Z{r,k) >0 such that with probability at 
least 1 — n~^: for all B < j < R with |S'jj > log^ n, we have 

I C I t + ^ 

D~{Rj{v)) <D-{Rj+i{v)) + Zh^^D"{R,{v)) + \og^^n. (39) 

^ (. = Iv 

We will analyze the recursive equation in Lemma 46 to obtain a bound on D~{Rj) for 
each j, in Section 5.5. 

We prove (39) by analyzing the exposure process from Section 5.1. We begin by analyzing 
EXPOSURE: we bound /max and prove that the parameters d'^{v), d~{v), satisfy certain 
properties (Sections 5.3 and 6). Next, we analyze EDGE-SELECTION one iteration at 
a time, with i decreasing from /max- When analyzing Rj to prove (39), iterations i = 
/max . • • •. ? + 1 have already been completed. So for v E Sj^, we have already exposed 
Rivy Riv-ly •••) ^j+1 ■ 
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5.3 Properties of Si 


We first run EXPOSURE. We prove that a.a.s. the sets Si satisfy certain properties. 

Lemma 47. There exist constants B,Yi,Y 2 , Zi, dependent only on r,k, such that a.a.s. 
for every B < i < /max with IRjl > \og^ n: 

(a) if |S'i| < then (1 - < \Si+i\ < (1 - Y 2 n~^^‘^)\Si\; 

(b) tf\S,\>n^-^ then (1-W^)|^,| < < (i _ 

(c) < Zi\Si\n^/^. 

(d) l-Sil < Enes, d~{u) < l^il + + log^n; 

(e) + log^n; 

(f) Ea>2,b<r-a + log^ 

(a) 'Eu&Si d^{u)d-{u) < J2ueSi d^{u) + Zi^ + log^ n; 

(h) EuesSd-iu)^ < Z,\S,\; 

(i) d~{u) < logn for all u G 

The proof is deferred until Section 6. 

5.4 Proving the recursive bound 

Here, we will prove the recursive bound from Section 5.2. Recall that we specihed a 
vertex v, and ly < /max is the iteration during which v was deleted. 

Our goal is to prove Lemma 46, and so we can assume j > B and \Sj\ > n^log^n. For 
each j, we will prove the probability that (39) fails is less than n~^, and so a union-bound 
implies that the probability it holds for every j is at least 1 — Ivn~^ > 1 — n~^. 

The setting is that we have carried out EXPOSURE, and we assume that all properties in 
Lemma 47 hold. We have also carried out EDGE-SELECTION for iterations /max, •••, J + 1, 
and in particular, we have exposed /?/„,..., Rj+i. 

The random experiment in this section is iteration j of EDGE-SELECTION. At this 
point, we know that there are ii/“’^ (a, 6)-edges for each a>l,0<6<r — a — 6, but we 
don’t know which vertices are in these edges. Each (a, /)-edge consists of r blanks which 
will be hlled with vertex-copies: the a level-j blanks will receive vertex-copies from Sj] the b 
level-{j + 1) blanks will receive vertex-copies from S'j+i; the remaining r — a — b blanks will 
receive vertex-copies which will then be allocated to bins in Ck U[=j +2 

We hll in these blanks with three independent steps, in reverse order of how they are 
listed above. 
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Step 1: For each (a,6)-edge, we choose the r — a — b vertex-copies and then 
allocate them to Ck Ul=j +2 

In order to bound D~{Rj), we will first bound 


A : the total number of level-j blanks, amongst all hyperedges 

deleted in iteration j that contain a vertex-copy from (40) 

Note that A is an upper bound on 

We let Ai denote the number of these level-j blanks whose hyperedge contains a vertex 
from ui^j^Ri- Note that Ai is determined by Step 1: After allocating these vertex-copies to 
Ck U [=^+2 know exactly which (a, fe)-edges contain vertex-copies that are allocated to 

Each time we allocate one of these copies, the probability that we allocate it to a bin 
from uil'f Ri is at most 

t—lv ^ 


ufit Hi 


\Ct U£ 


< 


max Q 

i=j+2 


a{c)n + o{n) ^ 


i+2 


i—Iv 


where a{c) is from Lemma 7. 


There are at most {k — l)|S'j| such hyperedges, and for each one we choose at most r — 1 
vertices. So the expected number of hyperedges which have at least one vertex-copy allocated 
to ui^j^Ri is at most II 1-^*1- Standard concentration bounds on binomial 

variables imply that the probability this number exceeds + log^n is less 

than n~^. Each such hyperedge contains fewer than r vertex-copies from Sj. So with 
probability at least 1 — n~^, we have 

2l I Q I t + ^ 

Ai < ^ |i?*|-Frlog^n. (41) 

ale) n 

t = lv 


Step 2: For each (a, 6)-hyperedge, we choose the b vertex-copies from Vi. 

We let A 2 = A — Ai- So A 2 is the number of level-j blanks whose hyperedge contains a 
vertex-copy from Rj+i and was not counted in Ai. Note that A 2 is determined by Steps 1 
and 2. 

It will be easier to focus on A 2 > A 2 which is dehned to be the number of pairs of vertex- 
copies V G Sj,u G Rj+i that both he in an (a, 6)-edge for some a,b. We only require an 
upper bound on A 2 , so it will suffice to bound A^. 

Each (a, 6)-edge contributes a to A 2 for each of its b copies from Sj+i that he in Rj+i. 
The total number of copies in Rj+i is D~{Rj^i) so 


E(A*) = 


< 

< 

< 


J^abM; 


a,b D (Rj+l) 


E. 


a,b ^u&Sj+i 

D~{Rj^l) 


d (u) 


\S: 


j+l\ 


abMf'" since d (m) > 1 for all u G Sj^i 


a,b 




n 


by Lemma 47(e,a,b) 


1^: 


D (Rj^i) + 2 Z 1 ——D (Rj^i) + 2^1 log n, 


(42) 
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since, by Lemma 47(g), D (Rj+i) < D {Sj+i) < {v)f < Zi\Sj+i\. 

To bound A 2 , we will prove that it is concentrated. Note that A 2 > D~{Rj^i) since every 
vertex-copy of Rj+i lies in an (a, 6)-edge for some a > 1, and hence contributes at least one 
of the pairs counted by A^. This allows us to focus instead on proving the concentration of 

X = a;-d-{r,+,). 

X is typically much smaller that A^. This will be an advantage when we use E(X) rather 
than E(A 2 ) in our calculations in (43) below. Note that D~{Rj_^.i) is fixed. 

To prove concentration of JA, we apply McDiarmid’s variation on Talagrand’s Inequality[37]. 
We use the version stated in [33]: 

McDiarmid’s Inequality [29] Let X be a non-negative random variable determined by in¬ 
dependent trials Ti,...,Tm and independent permutations ni,...,!!^'. We call the outcome 
of one trial R, or the mapping of a single element in a permutation flj, a choice. Suppose 
that for every set of possible outcomes of the trials and permutations, we have: 

(i) changing the outcome of any one trial can affect X by at most g; 

(a) interchanging two elements in any one permutation can affect X by at most g; and 

(Hi) for each s > 0, if X > s then there is a set of at most qs choices whose outcomes 
certify that X > s. 

Then for any t > 0, we have 

Pr(|X - E(Jf)| > i + + 128A) < 4exp {- + t) 

Our random choice in Step 2 is an assignment of the vertex-copies of Sj+i to the (a, b)- 
edges. This can be done by taking a uniform permutation of those vertex-copies: There 
is an implicit listing of all (a, 6)-edges over all a, b: For each edge, we know the values of 
a, b corresponding to that edge, but we don’t yet know exactly which a copies from Sj and 
b copies from Sj^i are assigned to each edge; the permutation of Step 2 determines the 
assignment of the copies of Fj+i. Thus the choice corresponding to a particular vertex-copy 
specihes the hyperedge to which it is assigned. We will apply McDiarmid’s Inequality using 
this permutation. 

If we exchange the position of two vertex-copies in the permutation, we are swapping the 
hyperedges to which they were assigned. This can change A 2 , and hence JA, by at most r — 1 
(the extreme case is when the swap involves an (r — 1, l)-edge). So we take g = r — 1. 

To certify JA > s, we can always present £* < s vertex-copies ui, such that each U£ 

is assigned to an {ai,bi)-edge with at >2 and this can be certihed 

by the outcomes oi t < s choices, and we can take g = 1. 

Setting t = max{E(JA), log^ n}, it is easy to see that 25g\/K{X) -\- 128g‘^ < t and E(X) -|- 
t < 2t. So McDiarmid’s Inequality yields: 

Pr(X > E(X) + 2t) < 4exp < 4exp (-^]) = (43) 
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Therefore, applying (42), with probability at least 1 — n we have 

A 2 < a; < D-{R,+i) + + 3Zi log' n. (44) 

Step 3: For each (a, 6)-hyperedge, we choose the a vertex-copies from Sj. 

In most of the analysis for this step, the probabilities are in terms only of the random 
choices in Step 3; it is assumed that the choices from Steps 1 and 2 are fixed. We will use 
the notation Prs and E 3 to indicate this. 

Recall from Dehnition 38 that Sj is the hypergraph formed by treating each (a, 6 )-edge 
as a hyperedge on a vertices of Sj] we call this an a-edge. At this point, each a-edge consists 
of a blanks, and we have a pool containing dj{u) vertex-copies of each u E Sj. The random 
choice in Step 3 is a uniformly random bijection from these vertex-copies to these blanks. 

We dehne 

$ : the hyperedges of Sj that belong to (a, 6 )-edges containing vertex-copies from Ri] 

(45) 

thus R'j is the set of vertices of <h. Note that $ was determined by Steps 1 , 2 , and that 
A = Ai -|- A 2 is the total number of blanks contained in the hyperedges of $. 

For each hyperedge / G Sj, Cf denotes the component of Sj that contains /. Thus Rj 
is the union over all / G $ of the vertices in Cf, and D~{Rj) < X]/e 4 > |A*~(C'/)| (there is an 
inequality here as some pairs /, /' G $ may lie in the same component). In order to bound 
K{D~{Rj)), we will bound 'Es{D~{Cf)) for each /. 

Consider any particular hyperedge f E Sj. We will expose Cf using a branching process 
as in [32]: 

Initially, every blank of / is labelled open. 

While there are open blanks: 

Take any open blank, and choose a uniform vertex-copy for that blank; 
the blank is labelled closed. 

Assign every other copy of the same vertex to a blank chosen uniformly 
from those not yet assigned vertex-copies. 

For each blank g that is chosen: 

if it is open (i.e. already known to be in Cf), we label it closed] 
else, every other blank in the same hyperedge as g is labelled open. 

Intuition: By Lemma 47(e,f), the vast majority of blanks lie in 1-edges. So when we 
assign copies to blanks, we usually do not create any new open blanks. Thus, the process 
tends to die out very quickly and Cf typically has few (in fact, no) vertices outside of /. 

To analyze this branching process, we consider the following experiment. Select a se¬ 
quence of vertices ui, ...,Ui^^ 2 ^, each chosen from Sj without replacement, where the proba¬ 
bility that w E Sj\{ui, ...,Ui-i} is chosen to be Ui is proportional to dj{w). 

To run the branching process: each time that we need to select a new vertex-copy x 
for an open blank, we simply take the next vertex on the list ui,... and choose one of its 
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vertex-copies and let x be that vertex-copy. Note that these vertices are chosen with the 
correct distribution. 

We will prove that there is a constant W\ > 0 such that for any y > 0 and every 
1 < t < log^ n: 




IS, 


(46) 


for each a-edge /: Prs |C/| > t 


d (ui) = 1/ < 


i=l 


W,\Sj 


n 


r_ tzS: _1 

I (r-l)(fc-l) I 


(47) 


Proof of (46): From Lemma 47(g) and the fact that J2ueSj initially the 

expected value of d~{ui) is: 

Zlnes,-, ry \Sj\ , log^n 
< 1 + Zi-h 


6j{u) 


n 


l^.l ■ 


After removing at most log^n of the vertices of Sj, the expected value of d (ui) rises to at 
most 


1 Zi 




n 


+ 


log^n \ 1^,1 

\Sj\ ) I'S'il - log^n 


< 1 + 2Z] 


1^. 


n 


+ 2 


log^ n 


since \Sj\ > n^log^n. 

Proof of (4'^)- Conditioning on the event = U) exposes information about 

d~{ui), ...,d~{ut). This affects our conditional probability only through its effect on the 
conditional distribution of dj{ui), ...,dj{ut), as EXPOSURE determined {dj{u),d~{u)) for 
each u G Sj. So we can deal with this conditioning by proving that for any choice of 
5i,..., (5t: 


PrsdC^I > t I dj{ui) 


(5i, ...,dj{ut) 




(48) 


Suppose that we selected vertex ui. We place one of its vertex-copies into the open blank 
that we are filling. In order to create at least one new open blank, we must assign at least 
one of the other dj{ue) — 1 copies to a blank in an a-edge for some a > 2; we call such an 
edge a plural edge. 

The total number of blanks in Sj is at least times the number of vertex-copies 
in S'j+i (the extreme case is when every hyperedge is a (!,?" — l)-edge) and thus is at least 
^ by Lemma 47(a,b). If we have assigned the copies of fewer than log^ n vertices, 

then the number of unhlled blanks is at least ~ (^ ~ 1) log n > Lemma 47(f) 

says that the total number of blanks in all plural edges is at most Zi\Sj\'^/n + log^ n. So the 
probability that at least one of those dj{ui) — 1 < k — 2 copies of ug, is assigned to a plural 
edge at most 

^ ^ V ^ / / 2r n 
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If at least one copy of ui is assigned to a plural edge, then we create at most {k — 2)(r — 1) 
new open blanks. For an a-edge /, if \Cf\ > t, then we must have created at least t — a 
open blanks (in addition to the a initial open blanks in /) during the exposure of the hrst t 
vertices ui, ... ,Ut. So at least r (r.-i)(fc-i) 1 vertices ui, ...,Ut must have a copy assigned 

to a plural edge. The probability of this occurring is at most: 


r__1 




for an appropriate constant Wi, thus establishing (47). 

Having proven (46) and (47), these imply that there is a constant W such that 

for each a-edge /: E 3 {D~{Cf)) < a (l + ^ 


n 


\S, 


Proof First note that (47) implies: 


r__1 

I (fc-l)(r-l) I 


(49) 


Pr(|C/| > log^n) < 


Wr\S, 


n 


log^ n/{i —l)(fc—1) 


= o (n-“) . 


(50) 


Next, note that D {Cf) < rcn, the number of vertex-copies in the entire conhguration. 
So 


E3(C-(C,)) = J2vPr3(D-{C,)=y) 


y=l 


log^ n 


i=l 


< 5^2/ I PrsdC'/l > log^n) + ^ Pra I (|C/| > t) A («*)=!/ 

y=l y t=a 

< 0(n^)Pr3(|C/| > log^n) 

rcn log^ n / t 


y=l t=a 

log^ n 


.i=l 


lFi|S'd\ 


n 


by (47) 


< o{n ) + ^ y —~— J I 


log^ n 


< oin' 


■■)+E 


t=a 


hFilS'jiy 


n 




Ml + 2Z,lhl + 2!^ 

n |6i| 


n 


< o{n ^) -I- ya + 2k^r 

Tiz 1‘^il c: log^^ 

< Oi P WQi --|- 5a—I I—, 

n IS'jI 


1 + 2Z,A1 + 2!5^ 

n |5i| 


by (46) 


(51) 


for an appropriate constant IF; this is (49). 
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Our goal is to bound D~{Rj) < order to obtain (39). As in Step 2, we 

will instead focus on a related quantity: We use |/|* to denote the number of vertices in /; 
i.e. if / contains multiple copies of a vertex u then u is counted only once in |/|*. We dehne 

A = ^min{A)"(0/),log^n} - |/|*. 

/€$ 

Note that > |/|* as every vertex u e / satishes ^“(m) > 1. Also, log^n > r > |/|* 

and so X is non-negative. 

We will show that, with very high probability, D~{Cf) < log^u for every /, which allows 
us to work with X. The advantages of doing so are twofold: (i) subtracting |/|* from each 
term in the summand has the same advantage as switching to X in the analysis of Step 2; 
(ii) by introducing an upper bound of log^ n on the contribution of each term, we bound the 
effect of each random choice on X. Since Xl/g# I/I — (49) implies 

+ + (52) 

To prove concentration of X, we again use McDiarmid’s Inequality. 

Our random choice in Step 3 is an assignment of the vertex-copies of Sj to the blanks 
in the a-edges. As in the analysis of Step 2 , we do this by taking a uniform permutation of 
those vertex-copies. 

We hrst bound the amount by which exchanging two vertex-copies can affect X. Suppose 
that we replace a copy of u in an edge / with a copy of w in an edge /'. We will show that 
this cannot increase the contribution of the edges in Cf to X by more than 2{k — 1) log^n. 

Let C be the component containing / after the copy of u is replaced by a copy oj of 
w. Consider removing uj from /; so the size of / has been reduced by one. Let C by the 
hypergraph obtained from C by the removal of u. Let Ci be the component of C containing 
/ (if / had size 1 and hence is now empty, then Ci = 0) and let C 2 be the union of the other 
components of C. Note that before the copy of u was replaced with oa: Ci was a subgraph 
of Cf and C 2 was a subgraph of the component containing w. 

Case 1: |Ci|, IC 2 I > log^n. Then for every hyperedge /' G Ci, min(|C/-/|, log^n) = log^n 
both before and after the switch. The same is true of every hyperedge in 6 * 2 . So the 
contribution of those edges to X was unchanged by the switch. 

Case 2: |Ci| > log^u, IC 2 I < log^n. The contribution of the edges of C\ was unchanged 
by the switch, but the contribution of each edge in C 2 may have increased by up to log^ n. 
There are at most {k — 1 )|C' 2 | vertex-copies amongst those edges and so there are at most 
{k — 1 )|C' 2 | < {k — 1) log^n such edges. So the total contribution from those edges increases 
by less than (/c — 1 ) log® n. 

In each of the remaining two cases, similar reasoning shows that the total contribution 
of the edges in Cf increases by at most 2{k — 1) log®n. The same argument shows that the 
total contribution of the edges in Cp, the component containing f (the other hyperedge 
involved in the switch) is at most 2{k — 1) log®n. Thus no switch can increase X by more 
than 4(fc — 1) log®n. By considering the inversion of a switch, no switch can decrease X by 
more than 4(fc — 1 ) log® n. So we can take g = Ak log® n. 
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We next show that the event X > s can be certihed by revealing the outcomes of at most 
2rs choices. If X > s then there must be a subset of the edges of <h, say /i,.with 
> \ fi\* + Si ,> \ f£*\* + S£*, and Si + ... + = s with each S£ > 1. For 

each 1 < i < i*, we will certify that D~{CfJ > l/^l* + S£ by revealing at most 2rs£ choices 
as follows. We discuss two cases. 

Case 1: |C/J < \fe\* + S£. In this case, we reveal a spanning tree of Cf^. That is, we 
reveal the choice of vertex-copy assigned to each hyperedge of a spanning tree of Cf^. This 
spanning tree only needs to consist of fi, and at most S£ additional edges, and so the total 
number of vertex-copies whose assignments are revealed is at most r(l-l-S£) < 2rs£, as S£ > 1. 

Case 2: |C/J > \f£\* + S£. In this case, we will reveal a connected subgraph of Cf^ 
containing and at least S£ additional vertices. As described in Case 1, we can do so using 
at most S£ + l edges and hence by revealing the assignments of at most r{l + S£) < 2rs£ vertex- 
copies. Since each of these vertices has d~ > 1, this will certify that D~{Cf^) > l/^l* -f S£. 

This shows that we may certify X > s by exposing at most 2rs choices so we can take 
q = 2r. Setting t = max{E(X),6 x 2048/c^r log^^n}, it is easy to see that 25g^^/qE{X) + 
128 X g^q < t and E(X) + t < 2t. So applying McDiarmid’s Inequality with g = 4/clog®n 
and q = 2r yields: 

Pr(X > E(A) -I- 2t) < 4 exp ( ---^< 4 exp ( -^ 

^ ^ ^ ^ “ V 1024Prlog^2n(E(X)+t)y V 2048A:V log^^ n / 

< 4exp(—6 logn) = o(n“^). (53) 

To use our bound on X to obtain a bound on D~{Rj), we hrst note that by (50), with 
probability at least 1 — n~^: every Cf has size at most log^n and hence by Lemma 47(i), 
D-{Cf) < log^n for all /. This, (52) and (53) yield that with probability at least 1 — n 


D-(Rj) < x+5^iri<x+5^i/i 

/e-i- /G# 

< X-h A < E3(A)-F 2t-F A 

< 3 E 3 (X) + 12 X 2048fcV log^^ n + A 

< A + 3IF^ + 12 X 2048A:Vlog^3n 

V ^ \Sj\ ) 

< A 1^1 + 3IF^^ + 13 X 2048A:V log^^ n. 


since A < {k — l)|S'j|, hxand so Alog^?7,/|S'j| = o(log^^n). This, with (41) and (44), yields 
that with probability at least 1 — 3n~^, we have: 
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D-{R,) < + + 




i+2 \ 

J2\R^\ + Z[ log^n 

i=Iv ) 


X 1 + 3W 


l^.l 


n 


+ 13 X 2048/c^r log^^ n 


< D-iR,^,{v)) + V D~iR,{v)) + logi^n, 

n ^ 


i+i 


i=U 


for suitable constants Z[ and Z. This is (39). 

So the probability that (39) holds for a given j is at least 1 — > 1 — n~^. Taking 

the union bound for B < j < R yields Lemma 46. □ 


5.5 Solving the recurrence 


Because Lemma 46 holds only for IS),! > n^log^n and for j > B, we dehne: 

io = min{i : |S'i+i| < log^ n}; 

i.e. io is the largest index so that Lemma 46 applies to Sb, ■■■, *S'jg. Note that Lemma 47(a,b) 
implies 

IS'iol < 2n^ log^ n. (54) 

Recall that 

R(^)(n) = R(n)\(ui<,<B^*). 

We will prove: 

Lemma 48. There is a constant X = X{r, k) > 0 such that for any 1 < i < /max o-nd any 

V e Si.- 

Pr(\R^^\v)\ >n^'^)<4- 

Proof. Taking B and Z as specihed in Lemma 46, and taking the union bound over all 
vertices v, we have that with probability at least 1 — n“^: (39) holds for all v and all 
B <j < min{/„,io}- 

We hx a particular i > B and v E Si. As in the previous section, we say Ri instead of 
Ri{v). Restating (39) and replacing j with j' (because of an index change below) yields: 


D {Rj') R D (^Rjij^i) + Z 


\S: 


I 


+ log 


14 


n. 


for all B < j' < min{i, zq}. (55) 


This recursive equation bounds D~{Rji) in terms of D~{Rg) for values of i that are larger 
than j'. In order to analyze this using a recursive equation where values are bounded by 
values with smaller indices, we will perform the change of indices: 


J ■= i-j'- 
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We will recursively define such that: 


Tj > D {Ri-j) (56) 

First, we set up our base cases. Because (55) only holds for j' < min{i,io}, i-e. for 
j > max{0, i — iojj we need a base case for all smaller values of j. We set 

jo = max{0,i - io} 


and define: 


For 0 < j < jo : 

We verify that something even stronger than (56) holds for this base case. Applying 
Lemma 47(i), we have D~{Ri_j) < D~{Si_j) < lognjAi-jl. Lemma 47(c) and (54) yield 

i—io i—iQ 

'^D~{Ri_j) < logn^ IFi-jl ^logn^lF^I < logn-ZilFiJn^/^ = 0(72^*^/^ log^ n). (57) 

j=0 j=0 i>io 

This immediately yields (56) for all j < jo- 

We define the following recursion. 


1^.-7 


i-1 


Vjo + 1 < j < * - -B : = rj_i + Z—^ + 


n 


25 


(58) 


«=J0 


We argue inductively that (56) holds for all jo < j < i — B. We have already seen that 
it holds for the base case j = jo. For higher values of j, (55) yields: 




i-j+l 


D (Ri-j) < D (i?j_(j_i)) + Z 

7 /, 

i=i 
jo-l 


n 


D~{Re) + \og^^ 


n 


< rj_i + Z—D {Ri-e) + log^'^ n by Lemma 47(d) and induction on (56) 

£=jO £=0 


< 


Tj-i + Z —_|_ 72,25 noting that log^^ n is absorbed by 


i=jo 


= rj. 


Thus Tj bounds \Ri-j\: 

\Ri-j\ < D~{Ri_j) < Tj, V 0 < j < i — i?, 

It will be convenient to dehne: 

j 

(■=jo 
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Therefore, by (57), 

i—B i—B i—B jo — ^ 

\R^^\v)\ = < t^-B + n^'. (59) 

j=0 j=0 j=jo j=0 

To prove Theorem 41, it is sufficient to show that ti_B = . 

Since vj = tj — tj-i, rearranging (58), we have 

15'.1 

tj — tj-i = tj-i — tj-2 + Z — , Vj > Jo + 1) 

where and = 0. We solve this recurrence. It will be helpful to find sequences 

(®i)i>io ^^cl (^j)j>jo+i that satisfy 

tj - ajtj_i = hj{tj_i - aj-itj- 2 ) + Vj > jo + 1. (60) 

Rearranging gives 

~ tj-i = {ttj — 1 + bj)tj-i + {—aj-ibj)tj-2 + n , 
so we require that for all j > jo + 1, 

aj-l + bj = 1 + 

n 

1 , 

and we may set initial condition = &jo+i = 1. 

So we define aj, bj recursively as: 

bj = 1./cLj_iy Qjj = 2 —- T Z —■— 

ttj-i n 

and (60) holds. Since aj + > 2 for each j and aj^ = 1, it follows that aj > 1 for every 

j ^ jo- Next, we show that there is a constant D = D{r, k) > 0 such that for every j > jo + 1, 

1 +< a. < 1 + T)a/^3. (61) 

n V n 

The lower bound follows immediately from aj-i > 1 and the recursion aj = 2 — 1/aj-i + 
Z\Si-j\/n. We prove the upper bound by induction. By taking D > Z we ensure that (61) 
holds for j = jo + 1. Now assume that j > jo + 2 and that (61) holds for j — 1, and so: 

aj_i V n 

Since aj = 2 + Z\Si-j\/n — 1/aj-i, we have 

a. < 1 + + dJ^-^2z1±A, 

n \ n 
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By the definition of jo and ig, we have log^ n for all j > jo- So by Lemma 47(b), 

we have that for all jg < j < i — B, |S'j_j+i| < (1 — Y 2 \/\Si^j\/n)\Si_j\ for constant 
Y 2 = l 2 (r, k) > 0. Thus, 


n 



< i + z^+fl 

n \ 




= l + D 




n \ n 


where the last inequality holds by choosing D > 2Z/Y2- Thus, (61) holds also for j and thus 
it holds for every jg < j < i — B. 

Now we continue to solve the recurrence (58). Let Cj = tj — ajtj^i. Then, since bj = 
1 /oj-i < 1 for every j > jo + 1, we have 

Cj = bjCj-i + < Cj-i + < Cj^ + (j - jo)n^'^ < + (j - jg)n^^. 

Since j < i = 0 {n^B logn) by Theorem 3(b), this yields 

tj — ttjtj-i < logn) < U := for all j > jg + 1. (62) 

Let £g be the maximum integer for which |S'£g| > n^~^. Again applying Theorem 3(b), 

we have ig = 0(77,*^/^ logn). Note that, due to the monotonicity of jS'^l by Lemma 47(a,b), 

£g < ig trivially by the definition of £g and ig and by the fact that 0 < <5 < 1/2. 

Now is a good time to recall that our goal is to show tis = 

(62) says tj < ajtj-i + U. Applying this recursively yields that for every 1 < £ < i — B, 

i-l / i-2 h2 \ 

ti-B < ti-B-e O^i-B-h + f7 I 1 + tti-B-h j • (63) 

h=0 V /i2=0/i=0 / 


Since a/i > 1 for each h, we have 1 + Ilhlo — ^Y\!h=o^i-B-h- First assume 

that 7 > .^0 + 2. Taking £ = £g — B + 2 m (63) and noting that ai_ 4 _i <U£ghy (61) yields 


0—B+l 


Iq-B 


,-B 


ti-B < ^i-7o-2 I Bi-B-j \ YU£o Bi-B-j Y (1 + ti_£^_2)U£g (64) 

j=0 J j=0 j=0 


Again applying (61), we have: 


t-o-B 

n - 

3=0 


io-B 


< exp ( D ^ 

j=0 


IVbI 

n 


4 

= exp I T) ^ 
j=B 


1^7 


n 


(65) 


Next, we bound exp ( D Y1%b V ~ ) • ^7 Lemma 47(b), we have for all j < £g, 


IFj+il < exp 



IF, 
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and so 


|*5£o+i| < exp 



1^5 


By the definition of Eq and Lemma 47(a), we have | 5 ' 4 +i| > so 


and so 



= 0(n‘), 


since both D and Y 2 are positive constants. This, (64), (65), io = logn) and U = 

yield 

U-B = n^^^hi_eo-2- ( 66 ) 

It is easy to see that if i < £0 + 2, then the above argument already proves that U-b = 
by taking i = i — B in (58). So we assume that i > io + 2. It only remains to show 
that ti_ 4_2 = The same recursion that produced (63) yields that for every 1 < i < 

i-io- 2 - jo, 


e-i 


£-2 h2 




1-2 < 




>-2-l 


(^i-lo-2-h + f/ I 1 + ai-i^-2-h 

/i2=0 h=0 


/i=0 


Arguing as for (64), this time taking i = i — £0 — 2 — jo yields (noting that always) 


i—£o—3—jo i—£o—3—jo 

£i—£Q—2 — (^jo U'i') ££i—io—2—j ^ ^i—£Q—2—j'i 

j=0 j=0 

and arguing as for (65) yields 

i-£o-3-jo / i—3—jo 

P[ ai-to- 2 -,i < exp I D ^ 

i=o V j-eo+2 



(67) 


( 68 ) 


By the definition of £0 we have |S' 4 + 2 | < 'n} ^. By our definition of jo, for every £0 + 2 < 
j < i — 1 — jo, we have \Sj\ > log^ n, and so we can apply Lemma 47(a) to show 

\S,\ < (1 - < (1 - y2n-^/2)i-(£o+2)^i-5^ V 4 + 2 < j < ^ - 1 - jo, 


which implies that 

/ I Q I 

= 0 ( 1 ). 

i=4+2 ^ i>o 

This proves that ^*- 4-2 = by (67) and ( 68 ). So by ( 66 ) it completes our proof that 
ti-B = ■ Since |i7*''®^(n)| < ti-B + by (59), this proves the lemma. □ 
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5.6 Bounding the maximum depth 


Lemma 48 bounds for any non-/c-core vertex v in APr{n,m). Corollary 12 

implies that the same bound holds for 'Hr{n,m). 

The following lemma allows us to bound |-R(n)| using the bound on To present 

the lemma, we dehne N^{A) to be the set of vertices with distance at most s from A. The 
following easily proved lemma is from [1, Lemma 34], 

Lemma 49. Assume s,c> 0 are 0(1). A.a.s. for every subset of vertices A in 'Hj.{n,cn) 
such that A induces a connected subgraph, |iV®(A)| = Od^l + logn). 

Applying Lemma 49 with A = yields our bound on the maximum depth as 

follows: 

Proof of Theorem fl: By Lemma 48, there is a constant X > 0 such that in APr{n,m) 
a.a.s. there are no vertices v with . Any vertices in R{v) that are not contained 

in R^^\v), must have been removed during the hrst B rounds of the parallel stripping 
process, and must be within distance B of some member of R{v). This bounds each Rb{v) 
in APj.{n, m); Corollary 12 shows that the same bound holds for '^r)- Now we can apply 
Lemma 49, to show that a.a.s. for each v G Rr{n,m), |i?(n)| < Y{\R^^\v) \ + logn) < n^^^, 
for some constant Y = Y [B, r,k). □ 

6 Proofs of Lemma 47 and Theorem 3(a): the tight 
lower bound 


In this section, we will prove the key Lemma 47. Throughout this section, we have 
c = Cr^k + for some 0 < 5 < |. We have already seen that Lemma 47 is needed to 
complete the proof of Theorem 5. Lemma 47(a,b) will also immediately imply the lower 
bound in Theorem 3(a): 

Proof of Theorem 3(a): The upper bound was proved in Section 4.3.2, so we only need 
to establish the lower bound: that a.a.s. the stripping number is f2(n‘^/^ logn). 

By Lemma 47(a,b), there must be an iteration zq in the parallel process such that < 

lAjpl < . Taking the constant Yi from Lemma 47 and noting that 25 < 1, we let 

a > 0 be a constant satisfying 1 — 2Yia > 25. Applying Lemma 47(a) recursively for all 
*0 < * < ^ = nn'^/^ logn, we have 

|^*o+£| > (1 - Y^n-^/y\S,,\ > exp(-2W£n-^/2) ■ ^n'"^ = 

This is valid since l-Sjo+d > > n'^log^n for all 0 < j < .^ (which is desired in order 

to apply Lemma 47) by our choice of a. This shows that the number of iterations in the 
parallel stripping process applied to APr{n,m) is at least an'^/^logn; Corollary 12 implies 
that the same bound holds for Rr{n,m). This conhrms the lower bound in Theorem 3(a). 

□ 
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Most of the work in our proof of Lemma 47 goes towards proving parts (a,b). This 
requires a very tight analysis of the evolution of Lt, the total degree of the light vertices (i.e. 
vertices of degree less than k) in the hypergraph remaining after t iterations of SLOW- 
STRIP. Much of this work can be viewed as a strengthening of the analysis from Section 4. 
A key result from that section is Lemma 16 which bounds E(Li+i — Lt \ J^t) < 

This sufficed to prove the upper bound for Theorem 3(b), but to prove the lower bound, 
we need to strenghten Lemma 16 in two ways: (i) when Lt > the bound on the 

expected change is decreased to and (ii) we obtain matching lower bounds on 

the expected change. This is Corollary 54. 

Of course, the expected change in Li depends on the rate at which new light vertices 
are created. In Section 4.4.3 we studied this rate by examining two key closely related 
parameters: roughly speaking, C,t is the average degree of the heavy vertices in Gt (i.e. 
vertices of degree at least k) and pt is the probability that a uniform vertex-copy from the 
heavy vertices is a copy of a degree k vertex. We obtained a coarse bound on Q (Lemma 35, 
Corollary 36) which was sufficient to obtain Lemma 16. For our tighter analysis in this 
section, we will require a much tighter bound on Q (Lemma 50). In addition, we will 
introduce two more parameters. 

Recall that Gt is the hypergraph (i.e. configuration) remaining after t iterations of SLOW- 
STRIP on APr{n,m), and Hi is the hypergraph (i.e. conhguration) remaining after i — 1 
iterations of the parallel stripping process. Recall also that t{i) is the iteration of SLOW- 
STRIP corresponding to the beginning of iteration i of the parallel stripping process. So 
Gt(i) = Hi. Recall that r denotes the step when SLOW-STRIP terminates. 

Recall the dehnitions of L^, Dt, Nt, pt from Sections 4 and 4.4.3, and that (t = Dt/Nt. 
Recall that Pt = {(-h^, W,-Ds)}s<t- Our first new parameter is a very close approximation 
of Lt_|_i — Lt'. 

brt =-I + {r - l){k - l)pt. (69) 

This is a key parameter in analyzing the evolution of {Lt)t>o- If we view {Lt) as a 
branching process then 1+brt approximates the branching parameter of {Lt)] i.e. the expected 
number of new light vertices formed during step t of SLOW-STRIP. Note that for r = 2, this 
is exactly the expected number of new light vertices formed when the other endpoint of a 
deleted edge is heavy. But we need to account for (i) the possibility of both endpoints being 
light, and (ii) the fact that when r > 2 we typically remove multiple heavy vertex-copies. 
By (29) and (30), and noting that ht^i < Lt/{Lt + Dt — {r — 2)) = 0{Lt/n) (by Corollary 9), 
we have that a.a.s. for every t{B) <t<T, 

E{Lt+i - Lt I Pt) = brt + 0{Lt/n), E{Lt+i - Lt \ Pt) < brt + 0{n-^). (70) 

The second part of (70) above is applied when we only require an upper bound on E{Lt+i—Lt \ 
Pt). However, in some cases we need a lower bound as well, and we will use the first part. 

By (32), (33) and (34), there is a constant K > 0 such that 

a.a.s. for every t{B) <t<T, brt < (71) 

By (28), a.a.s. Ct = C + Q{n~^^‘^). By Lemmas 32 and 31 and considering the Taylor 
expansion of ^|J around a.a.s. 

Pr = V’(Cr) + 0(u“^/2logu) = V'(C)+'0'(C)(Cr-C)+O((Cr-C)^+U"^/^logu) = V'(C)-© (u'^^^). 
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By (21), "0(0 = l/(^ ~ 1)(^ ~ 1)) which implies that a.a.s. 


brT- = —Q{n 


(72) 


by (69). 

Our second new parameter, 7r{Gt), roughly counts the number of vertices in Gt which are 
not in the k-core and so will be removed before the process ends. Recall from Corollary 9 
that a.a.s. the /c-core has size an + {Ki + for a particular constant Ki > 0. So 

the number of non-/c-core vertices is simply \V{Gt) \ minus that number. 

We will prove (Lemma 52) that brt is the same multiplicative order as the negation of 
the proportion of vertices in Gt that are not in the k-core, until that proportion drops below 
j.j-5/2. point on, brt remains at —©(n”*^/^) (see (72)). To reflect this, we dehne 

7i{Gt) to stay at once the remaining number of vertices drop below that. So we 

dehne: 

7r{Gt) := \ V{Gt) \ - an - 

Over the next few subsections, we obtain tight bounds on brt in terms of Lt (Lemma 53); 
coupled with (70), these will yield our bounds on the expected change in Lt- To do so, we 
will link brt and lit, using (f Our hrst step will be to obtain a tighter bound on Q- 

6.1 Controlling Q 

We list below a few facts that we will use. Since c = Cr^k + n~^,5 < we can assume by 
Lemma 7 that there is a k-core on a linear number of vertices. At each step of SLOW-STRIP, 
we remove at most one hyperedge. Thus there is a constant Q = Q{r, k) such that in every 
step, the average degree of the heavy vertices is changed by at most ±Q/n; i.e. 

Ci+i — Ct = 0{l/n) uniformly for all 0 < t < r. (73) 

Therefore, for any e > 0, Lemma 15 allows us to choose sufficiently large B such that 
ICt ~ Cr| < e/2 for all t > t{B). Recalling the dehnition of ( from (12), we also know that 
a.a.s. ICr “Cl = o(l) by Corollary 9. Hence, for all t > t{B), ICi — Cl < ICt “Cr| + ICr — Cl < e. 
This immediately gives (Fa) below. 

(Fa) For every e > 0, we can choose B sufficiently large such that a.a.s. for all t > t{B), 

iCt-Cl <e. 

(Fb) A.a.s. for every t > t{B), pt = (l + 0(?7, iogn))'0(Ct)) by Lemma 32 and Theorem 6. 

(Fc) We can choose e > 0 sufficiently small so that there are Ci,C 2 > 0 such that —Ci < 
< —C2 for all X such that |a; — C| < e by Lemma 31 and since C = Cr,k > k. 

By (Fa) and (Fc), we may assume B is chosen so that 

(Fc’) —Cl < 'C^'(Ct) < —C 2 uniformly for all t > t{B). 

By (73) we have 
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(Fd) 0 = Ct + 0{{i — t)/n) uniformly for every 0 < f < i < r. 

We will apply Lemma 29 several times. To formalise its application, we need to define 
normal configurations as in Section 4.4.3, and define several stopping times when various 

а. a.s. events such as (Fa) and (Fb) fail. We skip such tedious settings of stopping times, and 
note that by the same treatment as in Section 4.4.3, we will only create an o(l) error in all 
probability bounds if we apply Lemma 29 with events in (Fa)-(Fd) assumed. 

In the next lemma, we prove a more precise form of (Fd). This is our strengthening of 
Lemma 35. 

Lemma 50. A.a.s. for every t{B) < t < i < t, 

(oj) Ci < Ct + 0{\ogn/n) uniformly for every 0 <t < i < t; 

(b) if i — t > logn, then Q = (t — — t)/n) uniformly. 

Proof. By Lemma 34, there exist two constants pi > P 2 > 0 such that a.a.s. for every 
t{B) < t < T 

n n 

Moreover, |Ct+i—Ct| = 0{l/n) by (73). We apply Lemma 29 with c„ = 0(l/n) and a„ = P 2 ln 
and with a similar argument as in the proof of Lemma 19 to show, for every i > t and j > 0, 

Pr(Ci > Ct - (* - ^)P 2 /ri +j) < exp(-f2(j2?7,V(*-t))), (74) 

Pr{Q<(ft-{i-t)pi/n-j) < exp(-f2(/u7(* - t)))- (75) 

Then by the union bound (by taking j = {i — t)p2/2n in (74) and taking j = {i — t)pi/2n 
in (75) for each t and each i >t + logn), we obtain (b). Part (a) follows by (b) and the fact 
that for each i <t + logn, we always have Ci = Ct + 0{\ogn/n) by (Fd). □ 

б. 2 Relations between Lt, 7r{Gt) and brt 

The next Lemma essentially says that if we can bound the expected change in Li then we 
can show Li is concentrated. Recall that r is the stopping time of SLOW-STRIP; i.e. the 
hrst iteration t for which L^ = 0. 

Lemma 16 says that there are constants K,B > 0 such that a.a.s. for every t{B) <t<T, 

E{Lt+,\J^t)<Lt-Kn-^^‘^. (76) 


It is convenient to consider a random process such that E(£t+i | Ct) ^ Ct — 

Kn~^B for all t > t{B). The process (Ct) can be dehned in various ways depending on each 
application and in many applications we will let Ct = Lt (or sometimes with a shift of the 
subscript) for all steps in which (76) holds (c.f. the proof of Lemma 19 and the arguments 
in Section 4.3.2). We hrst prove some a.a.s. properties of such processes. 

Lemma 51. Let (£i)i>o he a random process and define T to be the minimum integer such 
that Ct < 0. Suppose there is a constant C > Q and functions C > a = a{n) > b = b{n) > 
0(^-5/2) following bounds always hold for every i > 0.- 
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(i) log^ n < Cq< n; 

(a) \Ci+i - Ci\< C; 

(a) -a < E(£j +1 - Ci I < -b. 

Then, with probability at least 1 — o{n~^), 

(a) Cq — 2ai < Ci < Cq — ^bi for all log^'^ n < i < ; 

(b) Ci < 2 Cq for all 0 < i < ; 

(^) <T < |£o- 

Proof. We start with the upper bound in part (a). We will apply Lemma 29 with = 
Ci — Cq. So we can take = —b. By (ii) we can choose Cn = C. 

Consider i > log^'^ n. If Ci > Cq — \bi then Xn,i — Xnp = Ci — Cq > iOn + \ib. By 
Lemma 29 with j = ^ib, the probability of this is at most 

exp (- 2 ,(c+| 1|)2 ) £ exp(-Sl(t 2 i)) < exp(-Sl(log‘ =«)) = o(n-=>). 

The lower bound in part (a) is nearly identical, but this time we apply Lemma 29 with 
Xn,i = Cq — Ci, On = a and j = ia. Applying the union bound for the at most choices for 
i shows that (a) holds with probability at least 1 — o{n~^). 

For part (b): If i < Cq/C then the fact that Cj+i< Cj + C implies that Ci < 2Cq. If 
i > Cq/C, then i > log^'^n, then part (a) implies Ci < Cq. 

For part (c): Note that |£o > log^'^ n, and so part (a) implies that Ci reaches 0 for 
some i < |£o (noting here that |£o = o(n^) by assumption (i)); this yields the upper bound 
on T. For the lower bound on T, we apply the same argument used for the lower bound in 
part (a) with i = ^Cq but with j = \Cq. This time we get 

Pr(£i < ^£o) < exp(-f2(£Q/i)) < exp(-n(a£o)) = o{n~‘^), 
thus providing the desired lower bound on T. 

□ 


Our goal is to bound brt in terms of Lt. First we bound it in terms of 7i{Gt). 

Lemma 52. There are two constants Ci,C 2 > 0 such that a.a.s. —Ci7r(Gt)/n < brt < 
—C 2 'n{Gt)/n for every t > t{B). 

Proof. Recall that C,t denotes the average degree of heavy vertices in Gt- Let t' be the 
maximum integer such that 7r(Gt/) = where Ki comes from Corollary 9. Note 

that 7r(Gt) is a non-increasing function of t, which decreases by at most 1 in each step. 
Hence, for all t < t', vr(Gt) > and for all t > t', TiiGt) < Kpn}~^l‘^. 

By Corollary 9, a.a.s. 7r(G.r) ~ (JFi/2)n^“'^/^. Since for every t, |vr(Gt) — 7r(Gt+i)| < 1 as 
at most one vertex is removed in each step, we have that a.a.s. for alH < F, r — t > '^{Gt) — 
Ti{Gr)= Vt{Ti{Gt)). This is because n^Gr) ~ (iFi/ 2 )n^“^A and so ii^Gr) ~ \Ti{Gt') < ^7r{Gt) 
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for every t < t'. The constant bounds involved in the asymptotic notation above and below 
will be uniform for all t. In particular, t — t' > 7i{Gf) — 'n'iGr) > On the 

other hand, for every t, t — t < k{7r{Gt) — 7r(G^)) < kir^Gt), since every light vertex in the 
queue takes less than k steps to be removed. So a.a.s. uniformly for every t < t', we have 

< T— t' <T— t <kn{Gt)', (77) 

T-t = 0(7r(Gt)). (78) 

If t < t' then T — t= 0 ( 77 ,^“'^/^) > logn by (77). Thus by Lemma 50(b) we have Ct = 
Cr + ©(('^ ~ By (Fb) and (Fc’) and by (69), a.a.s. for all t{B) < t < t, 

bn - br^ = 0(l)(pt - Pr) = - '0(Cr)) = -0(l)(Ct - Cr), (79) 

and then by the fact that Ct = Cr + ©(('?’ ~ for fol t < t' and (78), a.a.s. for each t < t', 

brt - br^ = -0((r - t)/n) = -0(7r(Gi)/n). (80) 

The constants in the asymptotic notations above are uniform for all t < r. By (72), a.a.s. 
bvr = —0(n“^/^). Since 7r(G.r) ~ (iFi/2)?7,^“^/^, it follows then that a.a.s. for all t(i?) <t <t', 
brt = —0(7r(Gt)/n); note that is absorbed as Tr{Gt)/n > Tr{Gr)/n = 0(n“^/^) for 

every t. This proves that our lemma holds for all t < t'. 

Now we consider t > t'. Applying (77) with t := t', we obtain that a.a.s. t — t' < 

kniGt') = kKiin}~^/‘^. This implies that for alH' < t < r, r — t < r — F = 0(n^“^/^). Then 
by (73) we have 

c, - c, = o{n-'-r‘) (81) 

So by (79), brt — br^ = —0(Ct — Cr) = 0(n“^/^) for all t' < t < t. Then, we must have 

brt = —0(n“'^/^) for all F < t < r by (71) and (72). The dehnition of t' implies that 

7r{Gt)/n = 0(n“'^/^) for all t > t'. This implies that our lemma holds also for t > t'. □ 

And now we are ready to bound brt in terms of Lp 
Lemma 53. There are constants Di, D 2 , B > 0 such that a.a.s. for all t > t{B), 

(a) for all t such that Lt > , —Di^yLt/n < brt < —Di^/Lijn. 

(b) for all t such that Lt < < brt < —D 2 n~^^‘^. 

Combining Lemma 53 with (70), and applying a slight adjustment to Di,D 2 yields our 
strengthening of Lemma 16: 

Corollary 54. There are constants Di,D 2 ,B > 0 such that a.a.s. for all t > t{B), 

(a) for all t such that Lt > , —Di^/Lfjn < E(Lt+i — Lt \ Bt) < —D 2 \/Lt/n. 

(b) for all t such that Lt < < E(Lt+i — Lt \ Bt) < —D 2 n~^/‘^. 
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This strenghtening is what we need to prove Lemma 47(a,b). 

Proof of Lemma 53 We take B large enough so that the relevant preceding results hold. 

We have brt < for some constant K > 0, for all t[B) < f < r by (71). For any 

t{B) < t < T, we will consider i such that t < i < t. By Lemma 50(a), a.a.s. for any such 
pair of t,i, we have Q ^ Ct + 0{\ogn/n). Thus by (Fb) and (Fc’) and the definition of bvi, 
a.a.s. for all pairs of t, i such that t{B) < t < i < t, 


bn = brt - Q{l){pt - Pi) = brt - = brt - 0(1)(O - Ct) 

' log n' 


> brt — Q 


n 


> ^brt, 


since \ogn/n = o{\brt\). By (70), a.a.s. for every t{B) < t < i < r, 


2brt - 0{Li/n) < E(Li+i - Lj | Bf) < brt + 0{n ^). 

We know that a.a.s. for every t{B) < t < i < r, we have brt, brt < by (71). So for 

some constant A > 0, we have a.a.s. for all t{B) < t < i < t: 

2brt - ALi/n < E(Li+i - Li \ Bf) < (82) 

Let ti be the smallest t such that Lt+i < log^ n. We first prove the lemma for all 
t < ti. 

Note that since brt ^ (by 69), we can take B large enough so that for i > B, Lt/n 
is small enough that the LHS of (82) is at least -3. So we let r* denote the first step that 
the condition —3 < E(Lj_|_i — Lj | J7) < fails; define r* = r if it never fails. 

Assume t < B. Define by letting Cq = Lt and = L* for each t < i < r* and 

Ci+i-t = Li-t — for all i > r*. Moreover, (82) implies that a.a.s. Ci-t = Li for all 

t < i < T. 

Now the process (£*) satisfies the hypotheses in Lemma 51(i) (by the definition of fi), 
(ii) with C = kr and (hi) with a = 3 and b = . Now by Lemma 51(b) (and by taking 

the union bound over t), a.a.s. Li < 2Lt (corresponding to Ci-t < 2 Cq for each given t) for 
all pairs of i,t such that t{B) < t < i < t. Then (82) becomes 

2brt - 2ALt/n < E(Lj+i - Li \ Ti) < (83) 

So we can apply Lemma 51 again to (£i)i>o defined as above (except that we modify r* 
to reflect (83) rather than (82)) with a = —2bfrt + 2ALt/n. Letting T denote the first 
step that Ci becomes at most zero. Lemma 51(c) shows that with probability 1 — o{n~^), 
T > 4ALt/{\brt\ + Lt/n)] i.e. r > t + 4:ALt/{\brt\ + Lt/n). Taking the union bound over 
t{B) < t < ti and using that by (83) a.a.s. Ci-t = Lt for all t{B) < t < i < r shows that 
a.a.s. for all t{B) < t < tp. 

T >t + AALt/{\brt\ + Lt/n). 

We delete at least one vertex for every k — 1 steps of SLOW-STRIP, and so \V{Gt) \ 
V{Gr)\ > [r — t)/{k — 1). Therefore, applying Corollary 9, we have that a.a.s. for all 
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t{B) 




3 


> 




_ + 

(k-l){\bn\ + L,/n) ^ 3 


By Lemma 52, there exists a constant G 2 > 0 snch that a.a.s. for all t{B) < t < ti, 


bn < -6*2^^ < -6*2 


n 


4^ALt/n :^„- 5/2 

ik-l)i\bn\+Lt/n)^ 3 


and so 


\brt\ > G 2 max 


AALt/n 


Ki _ 


-n 


<5/2 


(84) 


(85) 


{k - l)(|6rt| + Lt/nY 3 

Taking B large enongh that Lt/n is snfficiently small for t > t{B) (by Lemma 15), and 
rearranging (85), there is a constant D 2 > 0, snch that a.a.s. for all t{B) 


\brt\ > D 2 max 





n 


( 86 ) 


This yields the npper bonnds in onr lemma for t <ti. Next we prove the lower bonnds; i.e., 
we wish to prove that for some constant Hi > 0 for all f(i?) < t < i < ti: 


\brt\ < Di max 



(87) 


The proof is similar to that for the npper bonnd so we briefly describe the argnments. 
Let A 2 , A^> 0 be the implicit constants in (Fd), (79), and set Ai = 17 ( 2 ^ 2 ^ 3 ). Apply¬ 
ing (79), (Fd) we get that a.a.s. for any t{B) <t<i< min{f -|- Ai| 6 rf |n, r}: 

bn - bn < AYCt - Ci) < MAYY - t)/n) < AxA 2 AYbn\ = 

where the hrst ineqnality above comes from (Fb), (Fc’) and (69) (in the same way that (79) 
is dednced), the second ineqnality follows by (Fd) and the third ineqnality follows by the 
npper bonnd of i; the eqnality holds by the dehnition of Ai. Thns, bvi < \brt. So, by (70): 

¥.{Li+i\ Bi) - Li <-brt +0{l/n), Vf(i?) < f i < min{fAi| 6 rt|n, r}. ( 88 ) 

We can assnme | 6 rt| > (Y>/Ai)Lt/n, as otherwise (87) holds with Hi = ^JsJA[. Therefore 
we have (i) 8 Li/| 6 ri| < Ai| 6 ri|? 7 , and (ii) the RHS of ( 88 ) is at most \brt (noting that 
\brt\ » 1/n by (71)). 

Dehne (Hi)i>o analogonsly to how it was dehned above, and apply Lemma 51(c) with b = 
jbrt (and with a = —2bn + ‘2ALt/n, as argned above). By the a.a.s. correspondence between 
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and L*, and noting that (i) above allows us to apply (88) for all i < min{t + 8Lt/|6rt|, r}, 
we can show that with probability 1 — the stopping time 


T <t + -Lt = t + 8 Lt/\brt\. 
b 


(89) 


Taking the union bound over t{B) <t<ti shows that a.a.s. (89) holds for all t{B) < t <ti 
for which \brt\ > y^{ 8 /Ai)Lt/n. 

Since we remove at most one vertex during each iteration of SLOW-STRIP, it follows 
that Gt contains at most 8 Ltl\brt\ non-/c-core vertices by (89). Recalling that vr(Gt) is 
approximately the number of non-/c-core vertices in Gt plus this implies that 

7 r(Gi) < A^{Lt/\brt\ + for some constant A 4 > max{8, Now, by Lemma 52, 

we have that there is a constant Gi > 0 such that a.a.s. for all t(B) < t < ti with \brt\ > 
Vi8/Ai)Lt/ n we have: 


bn > -GMGt)/n > -G1R4 




This implies ( 87 ) for t{B) < t < ti with Di = m.a,yi{^J8JA[^ G1A4]. 

Now we consider t > G. Since Lt^ > n^log^n, by ( 89 ), a.a.s. t — G < 8Ltj/|6rt^|. By 
the definition of ti we also have that + r < 2n^log^?7,. (86) says that a.a.s. 

\brti \ > D2n~^^‘^. So we obtain: a.a.s. for all t > G, 

T — t < T — G < (16/^2)?^^'^^^ log^ n. 

This implies that a.a.s. for all ti < f < r, t — ti < (16/1)2)?^^'^^^ log^n. Then, recalling that 
yl2,^3 are the implict constants from (Fd) and ( 79 ), and since \brt^ \ > D2n~^^‘^ (from (86)), 
a.a.s. for all G < t < t, 


\bn - bnA < A 2 A^{t - G)/n < A 2 A^{lQ/Di)n^^/‘^ ^ log^n = o{n = o(6nJ, 

as 5 < 1/2. Since (86), (87) hold for t = G then by increasing Di and decreasing D 2 , these 
bounds hold for t > ti, and hence for all t{B) <t<T. □ 


6.3 Proof of the key lemma 

And now we can prove Lemma 47, which we restate. 

Recall that we are carrying out SLOW-STRIP on APr{n,m), and so implicitly, this 
yields the steps of the parallel stripping process applied to APr{n, m). /max is the number of 
iterations carried out by the parallel stripping process, and Si is the set of vertices removed 
in iteration i. The number of hyperedges removed in iteration i that contain u is dA{u) for 
each u E Si, and is d~{u) for each u G Rj+i (if i < /max)- 

Lemma 47. There exist constants B,Yi,Y 2 , Zi dependent only on r,k, such that a.a.s. for 
every B < i < /max with IR/ > log^ n: 

(a) if \Si\ < then (1 - Yin~^^‘^)\Si\ < IRj+il < (1 - Y2n-^/'^)\Si\-, 
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(b) if 1^,1 > then (1 - < |^ml < (1 - Y,^\^)\S,\-, 

(c) E,>^\S,\ < Z^\S,\nY\ 

(d) l^.l < E. 6 S. d-i^) < 1 5 .1 + + log^n; 

(e) |5'j+i| < 'Yl,a,b^^^T^ — 1‘5'i+il + Zi-^^ + log^n; 

(f) T.a>2,b<r-a + log^ n' 

(g) E«e 5 i d+{u)d-{u) < Y.ueSi d^{u) + Zi^ + log^n; 

(h) 

(i) d~{u) < logn for all u G 

Proof. We take B large enough so that the relevant preceding results hold. 

Part (i): We allocate rcn vertex-copies to n bins. So the probability that at least one 
bin has size at least logn is at most nPr[Bin(rcn, d) > logn] which is easily computed to 
be o(l). Part (i) follows since d~{u) is less than the size of bin u. 

Part (d): We hrst prove (d) assuming (e). The hrst inequality follows since d {u) > 1 for 
all u E Si (with i > 2). For the second inequality, note that EueSi d~{u) = E„>, < 

and so the bound follows from Lemma 47(e) and the fact that a.a.s. |5'j+i| = 
©diSjl) by Lemma 47(a,b). 

Part (f): Ea> 2 , 6 <r-a J2a>2,i<b<r-a K’’' = , where Y is the number 

of hyperedges (i.e. r-tuples) that contains at least two vertex-copies in Si and at least one 
vertex-copy in S'j+i. 

Consider SLOW-STRIP from step t{i) to t{i -|- 1) — 1, i.e. the steps when the vertices in 
Si are removed. When SLOW-STRIP removes a hyperedge incident with n G S'*, it removes 
a vertex-copy from u and another r — 1 vertex-copies chosen uniformly at random. This edge 
counts towards Y only if at least one of these r — 1 vertex-copies are from S'*. Regardless of 
what happened during the removal of previous vertices from Si, there are at most (/c — l)|S'j| 
remaining vertex-copies from Si, and there are a total of ©(n) available vertex-copies (by 
Corollary 9). So the probability that at least one vertex-copy is selected from Si is at most 
0{\Si\/n). We remove at most {k — l)|5'j| hyperedges during this phase. So Y is dominated 
in distribution by a binomial variable Bin((/c — l)|«S'd, 0{\Si\/n)) and so the Chernoff bound 
yields that with probability at least 1 —n~‘^, Y = OdFil • IFjl/n-|-logn). Multiplying by the 
at most n choices for i completes the proof for part (f). 

Part (e): The lower bound is trivial as every vertex in S'j+i is incident with at least one 
hyperedge counted by Eab^Tf’^, and each (a, 6 )-hyperedge contains b vertices of S'j+i. We 
now prove the upper bound. 
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Let A denote the sum of d (m) — 1 over all u G S'j+i with d (u) >2. We note that 


a,b 


< 


b a>2,b<r—a 

'Y, d~{u) + 0{\Si\'^/n) by part (f) 

ue5i+i 

lAj+il + A + 0{\Si\'^/n) since d~{u) > 1 for all u G S'j+i 


(90) 

(91) 

(92) 


To bound A, we bound Xj which is defined to be the number of vertices u G Aj+i with 
d~{u) > j. To do so, we expose the degree sequence of the configuration remaining after i 
iterations of the parallel stripping process, and then consider choosing the configuration using 
Bollobas’ configuration model. We emphasize that this exposure of the degree sequence, and 
choice of the conhguration are only carried out for the purposes of the proof of (93) below; the 
exposure does not carry on outside of this lemma - in particular to the analysis in Section 5.4. 

If a vertex u is counted by Xj, then we must have j < deg(M) < k — 1 + j, where deg(M) 
is its degree in the remaining configuration, and j copies of u must be selected for removal. 
At most (r — l){k — l)|S'j| vertex-copies are randomly selected for removal, and there are at 
least an vertex-copies to choose from (by Corollary 9). So the probability that u is counted 
by Xj is at most 


l^deg(u)^ - l){k - l)\Si\ y ^ ^ - 1 + - l){k - l)\Si\ y ^ 

for a suitable constant Z = Z{r,k). 

The probability that u, u' both count towards Xj are negatively correlated - if u counts 
towards Xj then at least j copies are chosen from u and hence are not chosen from u'. 
(Note: the reason that Xj is defined to be the number of u G S'j+i with d~{u) > j rather 
than d~{u) = j is that it makes this negative correlation easier to see.) Noting that there 
are at most n vertices with j < deg(M) < k — l+j, this implies that Xj is dominated by the 

binomial BIN{n, 2~^ (^Z^^ )■ Applying the Chernoff Bound and summing over all choices 
of j implies that with probability at least 1 — 


for all j > 2: Xj < n (93) 

(93) implies that A < = 0{\Si\‘^/n + \ogn). Multiplying by the at most 

n choices for i completes the proof for part (e) and hence of part (d). 

Part (g): Here, we focus again on A from part (e), this time using the value from 
iteration i — 1. So A is the sum of d~{u) — 1 over all u E Si with d~{u) >2. As we showed 
in part (e), a.a.s. A < Z\Si\‘^/n + log^n (for every i). Part (g) now follows since: 

Y^ d'^{u)d~{u) = Y^ d'^{u) + Y^ d'^{u){d~{u) — 1) < d'^{u) + {k — 1)A. 

udSi udSi udSi udSi 
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For the remaining parts, we focus on: 


Li = ^^ d^{v)] i.e. Li = 

v&Si 


We prove the following relation between Li and \Si\: a.a.s. for all i under the assumptions 
of this lemma, 

L, = {k- 1)|5,|(1 + 0{\S,\/n + log^u/IF,!)). (94) 

Since d'^{v) < k — 1 for every v G Si, immediately we have L^ < {k — l)|5'j|. If a vertex v 
in Si has d'^{v) < k — 1 then d~{v) must be at least two since otherwise v should have been 
removed before the i-th iteration of the process. Let I 2 denote the set of vertices u G S'* with 
d~{v) > 2 then 

U > ik-l)\S,\l2\. 


By (d), 

1^* \ / 2 I + 2 I/ 2 I < d-{u) < |5,| + 0{\Si\yn + log' n). 

uGSi 

It yields I/ 2 I = 0{\Si\‘^/n + log'u) and so \Si \ / 2 I = IFjl + 0{\Si\‘^/n + log'u) = IF^Kl + 
0{\Si\/n + log'n/IFjl)). It follows then that Li > (k — l)|S'j|(l + 0{\Si\/n + log'n/|S'j|)). 
Combining with the upper bound that Li < (fc — l)|S'j|, (94) follows. 

Parts (a,b): For (a), we have Li = r2(n^log'n) and Li = 0{n}~^) by the assumptions 
of the lemma and by (94). We will bound E(Lj+i — Lj \ J^j) in order to apply Lemma 51 as 
in the proof for Lemma 53. 

Since SLOW-STRIP removes at least one and at most r vertex-copies from Si in every 
iteration t{i) < j < t{i + 1), we have t{i 1) — t{i) = Q{Li), uniformly over i satisfying 
the conditions of the lemma. So -I- 1) > t(i) -|- n^log^'^n. Furthermore, by (Fd), for all 
t{i) < j < t{i -h 1) - 1, Cj - Ct{i) = 0{{j - = 0 {Li/n), and so by (79), we have 

bvj = brt(i) -|- 0{Li/n) = brt[i) -|- 0{n~^). Corollary 53(b) says that a.a.s.: 


- < E(Lj+i - Lj I JF,) < -D2n-^l‘^, 


(95) 


for all t{i) < j < t{i -|- 1) — 1. This allows us to apply Lemma 51 in a similar manner as in 
the proof of Lemma 53. I.e. we define a process Cj that is equal to Lj+t(i) so long as (95) 
holds, and such that we can apply Lemma 51 to Cj] then we translate what this says about 
Cj to Lj since (95) a.a.s. holds for all j. This yields that for each i satisfying the conditions 
of the lemma, with probability 1 — o{n~^), 

Lt(i+i) = Lt(i) - 0 (n"'^/')(t(i + 1 ) - t{i)) = Lt(i)(l - ©(n""^/')). 

Taking the union bound over all i yields part (a) with IFil replaced by Li. Then part (a) 
follows by (94) by noting that dominates the other errors \Si\/n and \o^ n/\Si\ for |S'j| 
in this range. 

The proof of part (b) is nearly identical, applying Lemma 53(a) rather than Lemma 53(b). 

Part (c): Recall that r is the stopping time of SLOW-STRIP. As in the proofs of parts 
(a,b), we will apply Lemma 51. There are two possible ranges for Lj and in both, we can 


take b = if Lj < ■n}~^ we use b = if Lj > we use b = Q{y^LjJn) > 

0(j^-(5/2)_ Lemnia 51(c) yields that with probability 1 — o{n~^), Ct < Cq + for 

each t > log^'^ n, which implies (since w.h.p. Lt(i)+t = until step r when Ct drops to 
zero) T — t{i) < Q{n^^‘^)Li. 

Since the total degree in decreases by at most r in every iteration of SLOW-STRIP, 
we have 

Lj < r(r — t{i)) < Q{n^^‘^)Li. 
j>i 

Taking the union bound over all i and then applying (94) yields part (c). 

Part (h): By (93) applied to iteration i — 1 and using parts (a,b), we have that with 
probability at least 1 — n~‘^ there is a constant Zi such that: 

5^(d-(u))' < 1^*1 + - l)Xj < 1^,1 + O(l^.-il) < Z,\s,\. (96) 

u£Si j>2 


Part (h) then follows by multiplying the failing probability of at most n ^ by the at most n 
choices for i. 

□ 
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Appendix 

Proof of Lemma 8. First we bound y := /i(c) — fir,k- Recall that yu(c) is the larger root of 
h{fi) = c and yr,k is the unique root of h{y) = Cr,k- As h{x) is convex over x G (0, +oo) and 
has derivative 0 at x = yr,k- The Taylor expansion at p = yr,k gives 


c = h{fir,k) + ^ ^ 


T,fc 


+ 




r.ki 


■y\l + 0{y)). 


Thus, y = ^y2/h"{yr,k)n + 0(n ^) = Kin + 0(n by letting Ki = y^2/h"{yr,k)- 

Next, we bound 


/fc(/i(c)) - ar,k = fk{p{c)) - fk{yr,k) = fkiLr,k)y + 0{y‘^). 


Recall that 

fk{x) = e-^^x^/i\. 

i>k 

Thus, /fc(x) = e~^x^~^ /{k — 1)!. Hence, fk{yr,k) > 0. It follows then that there is a constant 
R 2 > 0 , such that 

/fc(/i(c)) -a = K 2 n~^/^ + 0{n~^). 

Similarly, there is a JF 3 > 0 such that 

V(c)/fc_i(p(c)) - /3 = + 0{n~^). □ 

r 

Proof of Lemma 25. By the definitions of yr,k and h{y) from (4), h'{fir,k) = 0. Since 

, _ fk-i{xY~^ - x{r - l)/fc_i(x)’'-2/Li(a^) 

/,_i(x) 2 h-i) 

and /fc(x) = fk-i{x) — fk{x) for all /c > 1 , we have 

fk—l{pr,k) Pr,k(L 1) (/'fc—2(/^r,A;) fk—l(^Pr,k')Y 


i.e., 

fk-2{Pr,k) ^ 1 + yr,kY - 1) ^ ^ 1 

fk—l(.Pr,k') Pr,k(j' 1) Pr,k(j' 1) 

On the other hand, 

u ^-2 

fk-2{Pr,k) = fk-l{Pr,k) + 

it follows immediately that 

uf~‘^ 1 

(.-Mr.fc ^ 

i^k 2)!(Pj. Pr,k(L i-) 

By the definition of pr,k{k), a and /3, the assertion follows thereby. 


(97) 

□ 
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Proof of Lemma 27. Set /(x) = e^fk{x),h{x) = xe^fk-i{x). So we wish to show that 
f'{x)h{x) < f{x)h'{x). 


i>k 




/W = E-r. 


i>k 


X 


i>k 


i>k 


f{x)h'{x) = 


i,j>k 




f'{x)h{x) =J2 t: 


X 


i+j-l 


i,j>k 




2i—l 

When i = j, the contribution to each sum is the same: . When i ^ j, consider 

the contribution of {i,j) plus the contribution of The sum of these contributions to 

f{x)h'{x) and f'{x)h{x) is 




l)!j! 


+ 






2 


The contribution to f{x)h'{x) is larger since 4 + 4 > 2. □ 

Proof of Lemma 31. Let h(A) = e“'^A^“^//fc_i(A)(/c — 1)!. Recall ( 7 ^( 2 ^) above Lemma 27 and 
recall the dehnition of 'ijj{x) in (19). Then, 


= h{9k\^))^ 

where g'jf^ is the inverse of gk. It is easy to see and we have mentioned before that 
lima;_,.o= k. So for every x > k, g]f^{x) exists, and by Lemma 27 and the chain 
rule, the derivative of g'^^{x) is positive. Thus, in order to show that fj'^x) < 0, it is 
sufficient to show that for every A > 0, h'{\) < 0. 

Showing that h'{\) < 0 is equivalent to showing that for every A > 0, 


(^-i)Ef-^Ef 

* 7'>]b_9 * 


yk-^l 


-A + 1 




yj-fc+2 


is negative. It is trivially true if A > fc — 1. Now assume that A < k — 1. Since for every 
j > fc — 1, we have 


A^' 


-k+2 


< 


A 


j-k+2 


[j]j-k+2 \k-l 
where the inequality is strict except for j = k — 1. Thus, 

\j-k+2 


-A + (fc -1 - A) 


j>k-l 


[Jh- 


k+2 


< -A + (fc-l-A)^ 


i>o 


A 


k-l 


i+i 


= -A + (fc-l-A) 


A 


^-1 1 - 7 ^ 


= 0 . 


This completes the proof that h'{X) < 0. 


□ 
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We need the following technical lemma before proving Lemma 24. 

Lemma 55. For all k,r >2 with {r,k) ^ (2,2), h{x) < l/(r — 1) for all x > x* where 


h{x) = 


g— 1 


fk-i{x){k - 2)!’ 


and X* = r{k — 1) — 


r — 1 


Proof. It is easy to see that h{x) is a decreasing function on x > 0 (dividing e~^x^~^ from 
both the numerator and the denominator and then the numerator is constant whereas the 
denominator is an increasing function of x). Hence if we can prove h{x) < l/(r — 1) for some 
X < X* then we are done. 

We first prove a well-known inequality with respect to fki^h)- 
Claim 56. For any integer k < [/ij, /^(/i) > 1/2. 

Proof. Let M be the maximum integer such that M < p — 1. Then we must have k — 1 < 
M — 1. By the dehnition of fkih^), we only need to prove that 


E 



M-l 


i=0 



< 1 / 2 . 


Let p{i) = e /i\] it suffices then to show that for every 1 < f < M, p(M — i)< p{M -|- i). 
This follows easily from 


p{M — i) 
p{M + i) 


i 


n 

j=-i+i 


M + j 


i 

^ n 

j=-t+i 



< exp 




□ 


Now we continue to prove Lemma 55. We split our discussion into two cases. In both, we 
will reduce the lemma to checking a hnite number of pairs {k,r), which is straightforward. 

Case 1: k = 2, r > 3. Now h{x) = e~^x/{l — e~^) and x* = r — r/(r — 1). By computing 
the derivative of (r — 1) ■ h{x*{r)) with respect to r, it is easy to see that h{x) is a decreasing 
function on r > 3. Hence it suffices to verify that h{x*{3)) <1/2 which can be easily done. 

Case 2: k >3. We may easily verify the lemma for {k,r) = (3,2). So we assume that 
{k,r) 7^ (3,2). Let Xi = r{k — 2). Clearly xi < x* and [xij > k — 1. By Claim 56, we have 
fk-i{xi) > 1/2. Since h{x) is decreasing on a; > 0, it suffices to prove that h{xi) < l/(r — 1) 
and thus it suffices to prove that e~^^x\~^/{k — 2)! < Define 


0(x,r) 


(j,^)x+l 


x\ 


Then, e ^^x^ ^/{k — 2)\ = cfik — 2, r) and so it suffices to prove 

(r - l)0(/c - 2,r) < ^. (98) 

By computing the derivative of (r — l)-0(a:, r) with respect to r, we see that 

for (r — 1)^ > r"^/[x -|- 1) : (r — 1) • cf){x,r) is non-increasing on r. (99) 
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It is easy to see that 


4 >{x + 1, r) 
0(a:, r) 


e '’(rx + 

(x + l)(rx)^+^ 


re-" < ^g-r+l+l/x < ^e-r+2^ 


which is strictly less than one for any x > 4 and r > 2 . So 


for X > 4, r > 2 : (r — 1) • 0(x, r) is decreasing on x. (100) 

k > 6: We can easily verify that 0(4,2) < 1/2, and so by (99), this establishes (98) for 

k = 6,r > 2. Then (100) establishes (98) for all A; > 6,r > 2. 

k = 5: We can easily verify that 20(3,3) < 1/2, and so by (99), this establishes (98) for 

k = 5,r > 2. 

/c = 4: We can easily verify that 30(2,4) < 1/2, and so by (99), this establishes (98) for 

k = 4,r > 3. For r = 2, we verify the lemma directly from the dehnitions of h{x) and x*. 

k = 3: We can easily verify that 50(1,6) < 1/2, and so by (99), this establishes (98) for 
k = 3,r > 4. For r G {2,3}, we verify the lemma directly from the dehnitions of h{x) and 

X*. 

□ 


Proof of Lemma 2f. We hrst verify that ( > k. Recall that gk{x) = xfk-i{x)/fk{x). By 
dehnition of C,, a and (3 in (12) and (5), 


C 9k{Pr,k)- 


It is easy to see that hma;^o5'fc(2^) = k. Lemma 27 says that gk{,x) is strictly increasing on 
X > 0. It is easy to see that for any (r, fc) ^ (2,2), hr,k{p) in (3) tends to inhnity both 
when /i —)■ 0 and when /i —)■ cx). This implies that g,r,k is a positive real number. Thus, 
gk{^pr,k) ^ k. 

Next we prove that C, < r{k — 1). By (97), 


3 /-^r,k . 


k-1 

r^r.k 


(/c - l)!/fc_i(/ir,fc) (/c-l)(r-l) 


( 101 ) 


As we said above, ( = gk{pr,k) and so by (11), C = Pr,kfk-i{Pr,k)/fk{pr,k)- Hence, ( < r{k-l) 
is equivalent to 

fkipr,k) ^ 1 

pr,kfk—l(^pr,k) 'f'{k 1 ) 

Note that fkiPr,k) = fk-i{Pr,k) ~ Pr~k^ /~ I)') Combined with (101) the left hand side 
above is equal to 


1 

pr,k 



g k'r,k 


Pr,k 


{k - l)\fk-l{iar,k) 


1 

Pr,k 



1 

{k — l)(r — 



Hence C < ''"(A; — 1) is equivalent to 


(A; - l)(r - 1)) ^ r(A: - 1) 


i.e. Hr^k < k* '■= ^(k — 1) 


r 


r — 1 


( 102 ) 
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We consider the function h{x) from Lemma 55; i.e. 


e 7(^-2)! 

“ h-i{x) • 


By (101) (and multiplying A; — 1 on both sides), h{^r,k) = !/(?’ — !)• 
I^r,k < thus establishing (102) and hence the lemma. 

So by Lemma 55, 
□ 
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